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ENHANCED 
PROTEIN EXPRESSION IN BACILLUS 



FIELD OF THE INVENTION 

The present invention provides cells that have been genetically nnanipulated to 
have an altered capacity to produce expressed proteins. In particular, the present 
Invention relates to Gram-positive microorganisms, such as Bacillus species having 
enhanced expression of a protein of interest, wherein one or more chromosomal genes 
have been inactivated, and preferably v/herein one or more chromosomal genes have 
been deleted from the Bacillus chromosome. In some further embodiments, one or more 
indigenous chromosomal regions have been deleted from a corresponding wild-type 
Bacillus host chromosome. 

BACKGROUND OF THE INVENTION 

Genetic engineering has allowed the improvement of microorganisms used as 
industrial bioreactors, cell factories and In food fermentations. In particular, Bacillus 
species produce and secrete a large number of useful proteins and metabolites 
(ZukowskI, "Production of commercially valuable products," In: Doi and McGlouglin (eds.) 
BioloGv of Bacilli: ApDllcations to Industrv , ButtenA^orth-Heinemann, Stoneham. Mass pp 
31 1-337 [1992]). The most common Bacillus species used In industry are B. licheniformis, 
B. amylollquefaciens and B. subtllls. Because of their GRAS (generally recognized as 
safe) status, strains of these Bacillus species are natural candidates for the production of 
proteins utilized in the food and phanmaceutical industries. Important production enzymes 
Include a-amylases, neutral proteases, and alkaline (or serine) proteases. However, in 
spite of advances in the understanding of production of proteins in Bacillus host cells, 
there remains a need for methods to increase expression of these proteins. 

SUMMARY OF THE INVENTION 

The present invention provides cells that have been genetically manipulated to 
have an altered capacity to produce expressed proteins. In particular, the present 
invention relates to Gram-positive microorganisms, such as Bacillus species having 
enhanced expression of a protein of interest, wherein one or more chromosomal genes 
have been inactivated, and preferably wherein one or more chromosomal genes have 
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been deleted from the Bacillus chromosome. In some further embodiments, one or more 
indigenous chromosomal regions have been deleted from a corresponding wild-type 
Bacillus host chromosome. In some preferred embodiments, the present invention 
provides methods and compositions for the improved expression and/or secretion of a 
protein of interest in Bacillus. 

In particularly preferred embodiments, the present invention provides means for 
improved expression and/or secretion of a protein of interest in Bacillus. More particularly, 
in these embodiments, the present invention involves inactivation of one or more 
chromosomal genes in a Bacillus host strain, wherein the inactivated genes are not 
necessary for strain viability. One result of inactivating one or more of the chromosomal 
genes is the production of an altered Bacillus strain that is able to express a higher level of 
a protein of interest over a corresponding non-altered Bacillus host strain. 

Furthermore, in alternative embodiments, the present invention provides means for 
removing large regions of chromosomal DNA in a Bacillus host strain, wherein the deleted 
indigenous chromosomal region is not necessary for strain viability. One result of 
removing one or more indigenous chromosomal regions is the production of an altered 
Bacillus strain that is able to express a higher level of a protein of interest over a 
con-esponding unaltered Bacillus strain. In some preferred embodiments, the Bacillus 
host strain is a recombinant host strain comprising a polynucleotide encoding a protein of 
interest. In some particularly prefenred embodiments, the altered Sac///(/5 strain is a B. 
subtllls strain. As explained in detail below, deleted indigenous chromosomal regions 
include, but are not limited to prophage regions, antimicrobial {e.g., antibiotic) regions, 
regulator regions, multi-contiguous single gene regions and operon regions. 

In some embodiments, the present invention provides methods and compositions tor 
enhancing expression of a protein of interest from a Bacillus ceil. In some preferred 
embodiments, the methods comprise inactivating one or more chromosomal genes selected 
from the group consisting of sbo, s/r, ybcO, csn, spollSA, sigB, phrC, rapA, CssS trpA, trpB, 
trpC, trpD, trpE, trpF, tdh/kbl, alsD, sigD, prpC, gapB, pckA, fbp, mcA, ycgN, ycgM, rocF, and 
rocD In a Bacillus host strain to produce an altered Bacillus strain; growing the altered 
Bacillus strain under suitable growth conditions; and allowing a protein of interest to be 
expressed in the altered Bacillus, wherein the expression of the protein is enhanced, 
compared to the con-esponding unaltered Bacillus host strain. In some embodiments, the 
protein of interest is a homologous protein, while in other embodiments, the protein of interest 
is a heterologous protein. In some embodiments, more than one protein of interest is 
produced. In some prefenred embodiments, the fiac/Z/us species is a 6. subtllls strain. In yet 
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further embodiments, inactivation of a chromosomal gene comprises the deletion of a gene 
to produce the altered Bacillus strain. In additional embodiments, inactivation of a 
chromosomal gene comprises Insertional Inactivation. In some preferred embodiments, tiie 
protein of Interest Is an enzyme. In some embodiments, tiie protein of Interest is selected 
from proteases, cellulases, amylases, carisohydrases, lipases, Isomerases, transferases, 
kinases and phosphatases, while in other embodiments, tiie protein of interest is selected 
from tiie group consisting of antibodies, homnones and growth factors. 

In yet additional embodiments, tiie present invention provides altered Bacillus strains 
comprising the deletion of one or more chromosomal genes selected from tiie group of sbo, 
sin ybcO, csn, spollSA, sigB, phrC, rapA, CssS, trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, 
alsD, sigD, prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and rocD. In some 
embodiments, the altered sta-ain is a protease producing Bacillus sfa^aln. In an alternative 
embodiment, the altered Bacillus strain is a subtilisin producing stirain. In yet ottier 
embodiments, tiie altered Bacillus sb^in furflier comprises a mutation in a gene selected 
from tiie group consisting of degU, degQ, degS, scoC4, spollE, and oppA. 

In furttier embodiments, tiie present invention provides DNA constructs comprising an 
incoming sequence. In some embodiments, the incoming sequence includes a selective 
marker and a gene or gene fragment selected from the group consisting of sbo, sin ybcO, 
csn, spollSA, SigB, phrC, rapA, CssS, trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, a/sD, sigD, 
prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and rocD, In alternative embodiments, the 
selective marker is located In l)etween two fragments of tiie gene. In other embodiments, the 
incoming sequence comprises a selective mariner and a homology box, wherein the 
homology box flanks the 5' and/or 3* end of the marker. In additional embodiments, a host 
cell is transformed witii the DNA construct. In further embodiments, the host cell is an £ coll 
or a Bacillus cell. In some preferred embodiments, the DNA construct is chromosomally 
integrated into the host cell. 

The present invention also provides methods for obtaining an altered Bacillus strain 
expressing a protein of interest which comprises transfomriing a Bacillus host cell with the 
DNA constixict of tiie present, wherein the DNA construct is integrated into tiie chromosome 
of the Bacillus host cell; producing an altered Bacillus strain, wherein one or more 
chromosomal genes have been inactivated; and growing the altered Sac/7/us strain under 
suitable growtti conditions for tiie expression of a protein of interest. In some embodiments, 
the protein of interest is selected from proteases, cellulases, amylases, carbohydrases, 
lipases, isomerases. transferases, kinases and phosphatases, while in ottier embodiments, 
the protein of interest is selected from the group consisting of antibodies, hormones and 
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growth factors. In yet additional embodiments, the Bacillus iiost strain is selected from the 
group consisting of B, licheniformis, B. lentus, B. subtills, B. amyloliquefaclens S. brevis, B. 
stearothermophilus, B. alkalophllus, B. coagulans, B. circulans, B. pumllus, B. thuringiensis, 
6. clausih B. megaterium, and preferably, B. subtilis. In some embodiments, the Bacillus 
host strain is a recombinant host In yet additional embodiments, the protein of interest is 
recovered. In further embodiments, the selective marker is excised from the altered Bacillus. 

The present invention further provides methods for obtaining an altered Bacillus strain 
expressing a protein of interest. In some embodiments, the method comprises transforming 
a Bacillus host cell with a DNA construct comprising an incoming sequence wherein the 
incoming sequence comprises a selective marker and a gene selected from the group 
consisting of sbo, sir, ybcO, csn, spollSA, sIgB, phrC, rapA, CssS, trpA trpB, trpC, trpD, trpE, 
trpF, tdh/kbl, a/sD, sigD, prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and rocD, wherein 
the DNA construct is integrated into the chromosome of the Bacillus host cell and results in 
the deletion of one or more gene(s); obtaining an altered Bacillus strain, and growing the 
altered Bacillus strain under suitable growth conditions for the expression of the protein of 
interest 

In some alternative embodiments, the present invention provides a DNA construct 
comprising an incoming sequence, wherein the Incoming sequence Includes a selective 
marker and a cssS gene, a cssS gene fragment or a homologous sequence thereto. In 
some embodiments, the selective markeir is located between two fragments of the gene. In 
alternative embodiments, the incoming sequence comprises a selective marker and a 
homology box wherein the homology box flanks the 5' and/or 3' end of the marker. In yet 
other embodiments, a host cell is transformed with the DNA construct. In additional 
embodiments, the host cell is an E. coll or a Bacillus cell. In still further embodiments, the 
DNA construct is chromosomally integrated into the host cell. 

The present invention also provides methods for obtaining Bacillus suM//s strains that 
demonstrate enhanced protease production. In some embodiments, the methods comprise 
the steps of transfomiing a Bacillus subtilis host cell with a DNA construct according to the 
invention; allowing homologous recombination of the DNA construct and a homologous 
region of the Bacillus chromosome wherein at least one of the following genes, sbo, sir, 
ybcO, csn, spollSA, sigB, phrC, rapA, CssS, trpA, trpB, trpQ trpD, trpE, trpF, tdh/kbl, alsD, 
sIgD, prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and rocD, is deleted from the Bacillus 
chromosome; obtaining an altered Bacillus si/M//s strain; and growing the altered Bacillus 
strain under conditions suitable for the expression of a protease. In some embodiments, the 
protease producing Bacillus is a subtilisin producing strain. In altemative embodiments, the 
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protease is a heterologous protease. In additional embodiments, the protease producing 
strain further Includes a mutation in a gene selected ftom the group consisting of degU, 
degQ, degS, scoC4, spollE, and oppA. In some embodiments, the Inactlvation comprises 
the insertional inactlvation of the gene. 

The present invention further provides altered Bacillus suM//s strains comprising a 
d(3letion of one or more chromosomal genes selected from the group consisting of sbo, sir, 
ybcO, csn, spollSA, s/gB, phrC, rapA, CssS, trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, 
alsD, sigD, prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and rocD, wherein the altered 
Bacillus subtilis strain is capable of expressing a protein of interest. In some 
embodiments, the protein of interest is an enzyme. In some additional embodiments, the 
protein of Interest is a heterologous protein. 

In some embodiments, the present Invention provides altered Bacillus strains 
comprising a deletion of one or more Indigenous chromosomal regions or fragments 
thereof, wherein the indigenous chromosomal region includes about 0.5 to 500 l^ilobases 
(kb) and wherein the altered Bacillus strains have an enhanced level of expression of a 
protein of interest compared to the con^sponding unaltered Sac/Z/ws strains when grown 
under essentially the same growth conditions. 

in yet additional embodiments, the present invention provides protease-producing 
Bacillus strains which comprise at least one deletion of an indigenous chromosomal region 
selected firom the group consisting of a PBSX region, a skin region, a prophage 7 region, a 
SPp region, a prophage 1 region, a prophage 2 region, a prophage 3 region, a prophage 4 
region, a prophage 5 region, a prophage 6 region, a PPS region, a PKS region, a yvfF-yveK 
region, a DHB region and fragments thereof. 

In further embodiments, the present invention provides methods for enhancing the 
expression of a protein of interest In Bacillus comprising: obtaining an altered Bacillus strain 
produced by introducing a DNA construct including a selective marker and an inactivating 
chromosomal segment into a Bacillus host strain, wherein the DNA construct is integrated 
into the Bacillus chromosome resulting in the deletion of an indigenous ciiromosomal region 
or fragment thereof from the Bacillus host cell; and growing the altered Bacillus strain under 
suitable growth conditions, wfierein expression of a protein of interest is greater in the altered 
Bacillus strain compared to the expression of the protein of interest is the corresponding 
unaltered Bacillus host cell. 

The present invention also provides methods for obtaining a protein of interest from a 
Bacillus strain comprising the steps of: transfomning a Bacillus host cell with a DNA constnjct 
which comprises a selective marker and an inactivating chromosomal segment, wherein the 
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DNA construct is integrated into the chromosome of the Bacillus strain and results in deletion 
of an indigenous chromosomal region or fragment thereof to form an altered Bacillus strain; 
culturing the altered Bacillus strain under suitable growth conditions to allow the expression 
of a protein of interest; and recovering the protein of interest 

The present invention also provides a means for the use of DNA microanray data to 
screen and/or identify beneficial mutations. In some particularly preferred embodiments, 
these mutations involve genes selected from the group consisting of trpA, trpB, trpC, trpD, 
trpE, trpF, tdh/l<bl rocA, ycgN, ycgM, rocF, and rocD. In some preferred embodiments, 
these beneficial mutations are based on transcriptome evidence for the simultaneous 
expression of a given amino acid biosynthetic pathway and biodegradative pathway, and/or 
evidence that deletion of the degradative pathway results in a better perfomiing strain and/or 
evidence that overexpression of the biosynthetic pathway results in a better performing 
strain. In additional embodiments, the present invention provides means for the use of DNA 
microanray data to provide beneficial mutations. In some particularly preferred 
embodiments, these mutations involve genes selected from the group consisting of trpA, 
trpB, trpC, trpD, trpE, trpF, tdh/kbl rocA, ycgN, ycgM, rocF, and racD, when the expression of 
mRNA firom genes comprising an amino acid biosynthetic pathway is not balanced and 
overexpressbn of the entire pathway provides a better performing strain than the parent (/.e., 
wild-type and/or originating) strain. Furthermore, the present invention provides means to 
improve production strains through the inactivation of gluconeogenic genes. In some of 
these prefenred embodiments, the inactivated gluconeogenic genes are selected from the 
group consisting of pckA, gapB, and ftp. 

The present invention provides methods for enhancing expression of a protein of 
interest from Bacillus comprising the steps of obtaining an altered Bacillus strain capable of 
producing a protein of interest, wherein the altered Bacillus strain has at least one Inactivated 
chromosomal gene selected from the group consisting of sbo, s/r, ybcO, csn, spollSA, s/gB, 
phrC, rapA, CssS, trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, alsD, sigD, prpC, gapB, pckA, 
fbp, rocA, ycgN, ycgM, rocF, and rocD, and growing the altered Bacillus strain under 
conditions such that the protein of interest is expressed by the altered Bacillus strain, wherein 
the expression of the protein of interest is enhanced, compared to the expression of the 
protein of interest in an unaltered Bacillus host strain. In sortie embodiments, the protein of 
interest is selected from the group consisting of homologous proteins and heterologous 
proteins. In some embodiments, the protein of interest is selected from proteases, cellulases, 
amylases, carbohydrases. lipases, isomerases, transferases, Idnases and phosphatases, 
while in other embodiments, the protein of interest is selected from the group consisting of 
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antibodies. hormones and growth factors. In some particularly preferred embodiments, the 
protein of interest is a protease. In additional embodiments, the altered Bacillus strain Is 
obtained by deleting one or more chromosomal genes selected from the group consisting of 
sbo, sir, ybcO, csn, spollSA, sigB, phrC, rapA, CssS, trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, 
alsD, sigD, prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and rocD. 

The present invention also provides altered Bacillus strains obtained using the 
method described herein. In some prefenned embodiments, the altered Bacillus strains 
comprise a chromosomal deletion of one or more genes selected from the group consisting 
of sbo, sin ybcO, csn, spollSA, sIgB, phrC, rapA, CssS, trpA, trpB, trpC, trpD, trpE, trpF, 
tdh/kbl, alsD, s/gD, p/pC. gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and rocD, In some 
embodiments, more than one of these chromosomal genes have been deleted. In some 
particularly prefenBd embodiments, the altered strains are S. subtills strains. In additional 
preferred embodiments, the altered Bacillus strains are protease producing strains. In some 
particularly preferred embodiments, the protease is a subtilisin. In yet additional 
embodiments, the subtilisin Is selected from the group consisting of subtilisin 168, subtilisin 
BPN\ subtilisin Carisberg, subtilisin DY, subtilisin 147, subtilisin 309 and variants thereof. In 
yet further embodiments, altered Bacillus strains further comprise mutation(s) in at least one 
gene selected finom the group consisting of degU, ctegQ, degS, scoC4, spollE, and oppA. In 
some particulariy prefenred embodiments, the altered Bacillus strains further comprise a 
heterologous protein of interest 

The present invention also provides DNA constructs comprising at least one gene 
selected from the group consisting of sbo, sir, ybcO, csn, spollSA, sigB, phrC, rapA, CssS, 
trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, alsD, sigD, prpC, gapB, pckA, fbp, rocA, ycgN, 
ycgM, rocF, and rocD, gene fragments thereof, and homologous sequences thereto. In 
some prefen^d embodiments, the DNA constmcts comprise at least one nucleic acid 
sequence selected from the group consisting of SEQ ID NO: 1. SEQ ID NO: 3, SEQ ID NO: 
5. SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13. SEQ ID NO: 15. SEQ ID 
N0:17. SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:42. SEQ ID NO:44. SEQ ID NO:46. SEQ 
ID NO:48. SEQ ID NO:50. SEQ ID NO:37, SEQ ID NO:25. SEQ ID N0:21. SEQ ID NO:50, 
SEQ ID NO:29, SEQ ID NO:23, SEQ ID NO:27. SEQ ID N0:19. SEQ ID N0:31, SEQ ID 
NO:48. SEQ ID NO:46. SEQ ID NO:35, and SEQ ID NO:33. In some embodiments, the 
DNA constructs further comprise at least one polynucleotide sequence encoding at least one 
protein of interest 

The present invention also provides plasmids comprising the DNA constructs. In 
further embodiments, the present invention provides host cells comprising the plasmids 
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comprising the DNA constaicts. In some embodiments, the host cells are selected from the 
group consisting of Bacillus cells and £ coll cells. In some prefered embodiments, the host 
cell is S. subtilis. In some particularly preferred embodiments, the DNA constnjct is 
integrated into the chromosome of the host cell. In alternative embodiments, the DNA 
construct comprises at least one gene that encodes at least one amino acid sequence 
selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID 
NO: 8. SEQ ID NO: 10, SEQ ID NO: 12. SEQ ID NO: 14, SEQ ID NO: 16. SEQ ID N0:18, 
SEQ ID N0:41, SEQ ID NO:43. SEQ ID NO:45, SEQ ID NO:47, SEQ ID N0:49, SEQ ID 
N0:51, SEQ ID NO:38, SEQ ID NO:26, SEQ ID NO:22. SEQ ID NO:57. SEQ ID NO:30. SEQ 
ID NO:24, SEQ ID NO:28, SEQ ID NO:20, SEQ ID NO:32. SEQ ID NO:55, SEQ ID NO:53. 
SEQ ID NO:36, and SEQ ID NO:34. In additional embodiments, the DNA constructs further 
comprise at least one selective marker, wherein the selective marker is flanked on each side 
by a fragment of the gene or homologous gene sequence thereto. 

The present Invention also provides DNA constructe comprising an Incoming 
sequence, wherein the incoming sequence comprises a nucleic acid encoding a protein of 
interest, and a selective marker flanked on each side with a homology box, wherein the 
homology box includes nucleic acid sequences having 80 to 100% sequence identity to the 
sequence immediately flanking the coding regions of at least one gene selected from the 
group consisting of sbo^ s/r, ybcO, csn, spollSA, sIgB, phrC, rapA, CssS, trpA, trpB, trpC, 
trpD, trpE, trpF, tdh/kbl, alsD, sigD, prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and 
rocD. In some embodiments, the DNA constructs further comprise at least one nucleic acids 
which flanks the coding sequence of the gene. The present Invention also provides plasmlds . 
comprising the DNA constructs. In further embodiments, the present inventton provides host 
cells comprising the plasmids comprising the DNA constructs. In some embodiments, the 
host cells are selected from the group consisting of Bacillus cells and E co// cells. In some 
prefen-ed embodiments, the host cell is B. subtilis. In some particularly preferred 
embodiments, the DNA construct is integrated into the chromosome of the host cell, in 
additional prefenred embodiments, the selective marker has been excised from the host cell 
chromosome. 

The present invention further provides methods for obtaining an altered Bacillus strain 
with enhanced protease production comprising: transfonning a Bacillus host cell with at least 
one DNA constnjct of the present invention, wherein the protein of interest in the DNA 
construct is a protease, and wherein the DNA construct is Integrated into the chromosome of 
the Bacillus host cell under conditions such that at least one gene is inactivated to produce 
an altered Bacillus strain; and growing the altered Bacillus strain under conditions such that 
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enhanced protease production is obtained. In some particularly preferred embodiments, the 
method further comprises recovering the protease. In altematlve prefenred embodiments, at 
least one Inactivated gene is deleted from the chromosome of the altered Bacillus strain. 
The present invention also provides altered Bacillus strains produced using the methods 
described herein. In some embodiments, the Bacillus host strain Is selected from the group 
consisting of 6. Ilchenlfonvls, B. lentus, B. subtllls, B. amyloiiquefadens 8. brevis, B. 
slearothenrtophilus, B. alkalophllus, 6. coagulans, B. circulans, B. pumllus, 6. lautus, B, 
clausll, B: megaterium, and B. thuringiensis. In some prefenred embodiments, the Bacillus 
host cell is 6. subtilis. 

The present Invention also provides methods for enhancing expression of a protease 
in an altered Bacillus comprising: transfomriing a Bacillus host cell with a DNA construct of the 
present invention; allowing homologous recombination of the DNA construct and a region of 
the chromosome of the Bacillus host cell, wherein at least one gene of the chromosome of 
the Bacillus host cell Is inactivated, to produce an altered Bacillus strain; and growing the 
altered Bacillus strain under conditions suitable for the expression of the protease, wherein 
the production of the protease is greater in the altered Bacillus subtilis strain compared to the 
Bacillus subtilis host prior to transformation. In some prefenred embodiments, the protease 
is subtilisin. In additional embodiments, the protease is a recombinant protease. In yet 
further embodiments, inactivation is achieved by deletion of at least one gene. In still further 
embodiments, inactivation is by insertional inactivation of at least one gene. The present 
invention also provides altered Bacillus strains obtained using the methods described herein. 
In some embodiments, altered Bdc///t/s strain comprises at least one inactivated gene 
selected from the group consisting of sfto, sir, ybcO, csn, spollSA, sigB, phrC, rapA, CssS, 
trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, alsD, sigD, prpC, gapB, pckA, fbp, rocA, ycgN, 
ycgM, rocF, and mcD, In some preferred embodiments, the inactivated gene has been 
inactivated by deletion. In additional embodiments, the altered Bacillus strains further 
comprise at least one mutation in a gene selected from the group consisting of degU, degS, 
degQ, scoC4, spollE, and oppA. In some prefen-ed embodiments, the mutation is 
degU(Hy)32, In still further embodiments, the strain is a recombinant protease producing 
strain. In some preferred embodiments, the altered Bacillus strains are selected from the 
group consisting of S. licheniformis, 6. lentus, B. subtilis, S. amyloiiquefadens S. brevis, B. 
steamthermophilus, B. alkalophilus, B. coagulans, B. circulans, S. pumllus, 6. lautus, 8. 
clausii, B. megaterium, and 8. tf)uringiensis. 

The present invention also provides altered Bacillus strains comprising a deletion of 
one or more indigenous chromosomal regions or fragments thereof, wherein the indigenous 
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chromosomal region includes about 0.5 to 500 kb, and wherein the altered Bacillus s\ra\n has 
an enhanced level of expression of a protein of interest compared to a corresponding 
unaltered Bacillus strain when the altered and unaltered Bacillus strains are grown under 
essentially the same growth conditions. In preferred embodiments, the altered Bacillus 
strain is selected from the group consisting of B. licheniformis, B. lentus, B. subtilis, B. 
amyloliquefaciens B. brevis, S. steamthenvophilus, B. alkalophilus, 6. coagulans, B. 
circulans, B, pumilus, S. lautus, B. clausll, S. megaterium, and S. thuringiensis. In some 
prefenred embodiments, the altered Bacillus strain is selected from the group consisting of 
B. subtilis, B. licheniformis, and 6. amyloliquefaciens. In some particularly preferred 
embodiments, the altered Bacillus strain is a B. subtilis strain. In yet further embodiments, 
the indigenous chromosomal region is selected from the group consisting of a PBSX 
region, a skin region, a prophage 7 region, a SPp region, a prophage 1 region, a prophage 2 
region, a prophage 4 region, a prophage 3 region, a prophage 4 region, a prophage 5 region, 
a prophage 6, region, a PPS region, a PKS region, a YVFF-YVEK region, a DHB region and 
fragments thereof. In some prefen^d embodiments, two indigenous chromosomal regions or 
fragments thereof have been deleted. In some embodiments, the protein of interest is 
selected from proteases, cellulases, amylases, carbohydrases, lipases, isomerases, 
transferases, kinases and phosphatases, while in other embodiments, the protein of interest 
Is selected from the group consisting of antibodies, homiones and growth factors. In yet 
additional embodiments, the protein of interest Is a protease. In some preferred 
embodiments, the protease is a subtilisin. In some particularly preferred embodiments, the 
subtilisin is selected from the group consisting of subtilisin 168, subtilisin BPN\ subtilisin 
Carlsberg, subtilisin DY, subtilisin 147 and subtilisin 309 and variants thereof. In further 
prefened embodiments, the Bacillus host is a recombinant strain. In some particularly 
prefenred embodiments, the altered Bacillus strains further comprise at least one munition In 
a gene selected from the group consisting of degU, degQ, degS, sco4, spollE and oppA. In 
some prefenred embodiments, the mutation is degU(Hy)32. 

The present invention further provides protease producing Bacillus strains comprising 
a deletion of an indigenous chromosomal region selected from the group consisting of a 
PBSX region, a skin region, a prophage 7 region, a SPp region, a prophage 1 region, a 
prophage 2 region, a prophage 3 region, a prophage 4 regton, a prophage 5 regton, a 
prophage 6 region, a PPS region, a PKS regton, a YVFF-YVEK region, a DHB region and 
fragments thereof. In some preferred embodiments, the protease is a subtilisin. In some 
embodiments, the protease is a heterologous protease. In some preferred embodiments, the 
altered Bacillus strain is selected from the group consisting of B. Ilchenifbrmis, B. lentus, S. 
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subtilis, B. amyloliquefaciens B. brevis, S. stearotherwophilus, B. alkalophilus, B. coagulans, 
6. circulans, B. pumilus, S. lautus, B. clausii, fi. megaterium, and 8. thuringiensls. In 
additional embodiments, the 8ac///i/s strain is a B. subtilis strain. 

The present Invention also provides methods for enhancing the expression of a 
protein of interest in Bacillus comprising: introducing a DNA construct including a selective 
marker and an inactivating chromosomal segment into a Bacillus host strain, wherein the 
DNA constmct is integrated into the chromosome of the Bacillus host strain, resulting in the 
deletion of an indigenous chromosomal region or fragment thereof firom the Bacillus host cell 
to produce an altered Bacillus strain; and growing the altered Bacillus strain under suitable 
conditions, wherein expression of a protein of interest is greater in the altered Bacillus strain 
compared to the expression of the protein of Interest in a Bacillus host cell that has not been 
altered. In some preferred embodiments, the methods further comprise the step of 
recovering the protein of interest. In some embodiments, the methods further comprise the 
step of excising the selective marker from the altered Bacillus strain. In additional 
embodiments, the indigenous chromosomal region is selected from the group of regions 
consisting of PBSX, skin, prophage 7, SPp, prophage 1, prophage 2, prophage 3, prophage 
4, prophage 5, prophage 6, PPS, PKS, YVFF-YVEK, DHB and fragments thereof. In further 
embodiments, the altered Bacillus strain comprises deletion of at least two indigenous 
chromosomal regions. In some preferred embodiments, the protein of interest Is an enzyme. 
In some embodiments, the protein of interest is selected from proteases, cellulases, 
amylases, carbohydrases, lipases, isomerases, transferases, kinases and phosphatases, 
while in other embodiments, the protein of interest is selected from the group consisting of 
antibodies, honnones and growth factors. In some embodiments, the Bacillus host strain is 
selected from the group consisting of S. licheniformis, B. lentus, B. subtilis, 6. 
amyloliquefaciens B. brevis, B, stearothermophilus, S. clausii, B. allialophilus, B. coagulans, 
B. circulans, 6. pumilus and 6. thuringiensis. The present invention also provides altered 
Bacillus strains produced using the methods described herein. 

The present invention also provides methods for obtaining a protein of interest 
from a Bacillus strain comprising: transforming a Bacillus host cell with a DNA construct 
comprising a selective marker and an inactivating chromosomal segment, wherein the 
DNA constmct is integrated into the chromosome of the Bacillus strain resulting in deletion 
of an indigenous chromosomal region or fragment thereof, to produce an altered Bacillus 
strain, culturing the altered Bacillus strain under suitable growth conditions to allow the 
expression of a protein of interest, and recovering the protein of interest. In some 
prefen-ed embodiments, the protein of interest is an enzyme. In some particulariy 
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preferred embodiments, the Bacillus host comprises a heterologous gene encoding a 
protein of interest. In additional embodiments, the Bacillus host celi is selected from the 
group consisting of S. Hcheniformis, B. lentus, B. subtilis, B, amyloliquefaciens B. brevis, 
B. stearothermophilus, S. clausii, B. alkalophilus, S. coagulans, B. circulans, B. pumilus 
and 8. thuringiensis. In some preferred embodiments, the indigenous chromosomal 
region is selected from the group of regions consisting of PBSX, skin, prophage 7. SP3, 
prophage 1 , prophage 2, prophage 3, prophage 4, prophage 5, prophage 6. PPS, PKS. 
YVFF-YVEK, DHB and fragments thereof. In some particularly preferred embodiments 
the altered Bacillus strains further comprise at least one mutation in a gene selected from 
the group consisting of degU, degQ, degS. sco4, spollE and oppA, In some 
embodiments, the protein of interest is an enzyme selected from the group consisting of 
proteases, cellulases, amylases, carbohydrases, lipases, isomerases, transferases, 
kinases, and phosphatases. In some particulariy preferred embodiments, the enzyme is a 
protease. In some preferred embodiments, the protein of interest is an enzyme. In other 
embodiments, the protein of interest is selected from the group consisting of antibodies, 
hormones and growth factors. 

The present invention further provides methods for enhancing the expression of a 
protein of interest in Bacillus comprising: obtaining nucleic acid from at least one Bacillus ceil; 
perfonning transcriptome DNA an^y analysis on the nucleic acid finom said Bacillus cell to 
identify at least one gene of interest; modifying at least one gene of interest to produce a 
DNA construct; introducing the DNA construct into a Bacillus host ceil to produce an altered 
Bacillus strain, wherein the altered Bacillus strain is capable of producing a protein of interest, 
under conditions such that expression of the protein of Interest is enhanced as compared to 
the expression of the protein of interest in a Bacillus that has not been altered. In some 
embodiments, the protein of interest is associated with at least one biochemical pathway 
selected from the group consisting of amino acid biosynthetic pathways and biodegradative 
pathways. In some embodiments, the methods involve disabling at least one biodegradative 
pathway. In some embodiments, the biodegradative pathway is disabled due to the 
transcription of the gene of interest, l-jowever, It is not intended that the present invention be 
limited to these pathway, as it is contemplated that the methods will find use in the 
modification of other biochemical pathways within cells such that enhanced expression of a 
protein of interest results. In some particularly preferred embodiments, the Bacillus host 
comprises a heterologous gene encoding a protein of Interest, in additional embodiments, 
the Bacillus host cell is selected from the group consisting of S. lichenifomis, S. lentus, B. 
subtilis, B. amyloliquefaciens fi. brevis, fi. stearothermophilus, B. clausii, B. alkalophllus, B. 
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coagulans, B. circulans, B. pumilus and B. thuringlensis. In some embodiments, the protein 
of interest is an enzyme. In some prefen^d embodiments, the protein of interest is selected 
from proteases, cellulases, amylases, carbohydrases, lipases, isomerases, transferases, 
kinases and phosphatases, while in other embodiments, the protein of interest is selected 
from the group consisting of antibodies, homiones and growth factors. 

The present invention further provides methods for enhancing the expression of a 
protein of interest in Bacillus, comprising: obtaining nucleic acid containing at least one gene 
of interest from at least one Bacillus cell; fragmenting said nucleic acid; amplifying said 
fragments to produce a pool of amplified fragments comprising said at least one gene of 
interest; ligating said amplified firagments to produce a DNA constmct; directly transfomning 
said DNA construct into a Bacillus host cell to produce an altered Bacillus strain; culturing 
said altered Bacillus strain under conditions such that expression of said protein of interest is 
enhanced as compared to the expression of said protein of interest in a Bacillus that has not 
been altered. In some preferred embodiments, said amplifying comprises using the 
polymerase chain reaction. In some embodiments, the altered Bacillus strain comprises 
modified gene selected from the group consisting of prpC, sigD and tdh/kbL In some 
particularly preferred embodiments, the Bacillus host comprises a heterologous gene 
encoding a protein of interest. In additional embodiments, the Bacillus host cell is selected 
from the group consisting of S. licheniformis, B. lentus, B. subtilis, B. amyloliquefaciens S. 
brevis, B. stearothermophilus, B. clausii, B. alkalophilus, B. coagulans, 8. circulans, B. 
pumilus and fi. thuringlensis. In some embodiments, the protein of interest is an enzyme, in 
some preferred embodiments, the protein of interest is selected from proteases, cellulases, 
amylases, carbohydrases, lipases, isomerases, transferases, kinases and phosphatases, 
while in other embodiments, the protein of interest is selected from the group consisting of 
antibodies, honmones and growth factors. 

The present invention further provides isolated nucleic acids comprising the 
sequences set forth in nucleic acid sequences selected from the group consisting of SEQ ID 
NO: 1, SEQ ID NO: 3. SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ 
ID NO: 13, SEQ ID NO: 15, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, 
SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:37. SEQ ID NO:25, SEQ ID 
N0:21, SEQ ID NO:50, SEQ ID NO:23. SEQ ID NO:27. SEQ ID NO:19. SEQ ID NO:31. SEQ 
ID NO;48, SEQ ID NO:46, SEQ ID NO:35, and SEQ ID NO:33. 

The present invention also provides isolated nucleic acid sequences encoding amino 
acids, wherein the amino acids are selected from the group consisting of SEQ ID NO: 2, SEQ 
ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12. SEQ ID NO: 14, 
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SEQ ID NO: 16. SEQ ID N0:41, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47. SEQ ID 
NO:49. SEQ ID N0:51, SEQ ID NO:38, SEQ ID NO:26, SEQ ID NO:22. SEQ ID NO:57, SEQ 
ID NO:24, SEQ ID NO:28. SEQ ID NO:20, SEQ ID NO:32, SEQ ID NO:55, SEQ ID NO:53, 
SEQ ID NO:36, and SEQ ID NO:34. 

The present invention further provides isolated amino acid sequences, wherein the 
amino acid sequences are selected from the group consisting of SEQ ID NO: 2, SEQ ID 
NO: 4. SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 1.2. SEQ ID NO: 14. 
SEQ ID NO: 16. SEQ ID N0:41. SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47. SEQ ID 
NO:49, SEQ ID N0:51. SEQ ID NO:38. SEQ ID NO:26, SEQ ID NO:22. SEQ ID NO:57, 
SEQ ID NO:24, SEQ ID NO:28. SEQ ID N0:20. SEQ ID NO:32. SEQ ID NO:55. SEQ ID 
NO:53, SEQ ID NO:36. and SEQ ID NO:34. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1, Panels A and B illustrate a general schematic diagram of one method 
("Method 1"' See. Example 1) provided by the present invention. In this method, flanking 
regions of a gene and/or an indigenous chromosomal region are amplified out of a wild- 
type Bacillus chromosome, cut with restriction enzymes (including at least BamHI) and 
ligated into pJM102. The construct is cloned through £. coll and the plasmid is isolated, 
linearized with BamHI and ligated to an antimicrobial marker with complementary ends. 
After cloning again in E coll, a liquid culture is grown and used to isolate plasmid DNA for 
use in transforming a Bacillus host strain (preferably, a competent Bacillus host strain). 

Figure 2 illustrates the location of primers used in the construction of a DNA 
cassette according to some embodiments of the present invention. The diagram provides 
an explanation of the primer naming system used herein. Primers 1 and 4 are used for 
checking the presence of the deletion. These primers are refenred to as "DeletionX-UF- 
chk" and "DeletionX-UR-chk-del." DeletionX-UF-chk is also used in a PGR reaction with a 
reverse primer inside the antimicrobial marker (Primer 11: called for example PBSX-UR- 
chk-Del) for a positive check of the cassette's presence in the chromosome. Primers 2 
and 6 are used to amplify the upstream flanking region. These primers are refen^d to as 
"DeletionX-UP and "DeletionX-UR," and contain engineered restriction sites at the black 
vertical bars. Primers 5 and 8 are used to amplify the downstream flanking region. These 
primers are refen^d to as "DeletionX-DF" and •DeletionX-DR." These primers may either 
contain engineered BamHl sites for ligation and cloning, or 25 base pair tails homologous 
to an appropriate part of the Bacillus subtilis chromosome for use in PGR fusion. In some 
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embodiments, primers 3 and 7 are used to fuse the cassette together in the case of those 
cassettes created by PGR fusion, while in other embodiments, they are used to checic for 
the presence of the insert. These primers are refened to as "DeletionX-UF-nested" and 
"DeletionX-DR-nested." In some embodiments, the sequence corresponding to an 
"antibiotic marlcer" is a Spc resistance marlcer and the region to be deleted is the CssS 
gene. 

Figure 3 Is a general schematic diagram of one method ("Method 2"; See Example 
2) of the present invention. Flanking regions are engineered to include 25 bp of sequence 
complementary to a selective marker sequence. The selective mariner sequence also 
includes 25 bp tails that complement DNA of one flanking region. Primers near the ends 
of the flanking regions are used to amplify all three templates in a single reaction tube, 
thereby creating a fusion fragment. This fusion fragment or DNA construct is directly 
transfomned into a competent Bacillus host strain. 

Figure 4 provides an electrophoresis gel of Bacillus DHB deletion clones. Lanes 1 
and 2 depict two strains carrying the DHB deletion amplified with primers 1 and 11, and 
illustrate a 1.2 kb band amplified from upstream of the Inactivating chromosomal segments 
into the phleomycin marker. Lane 3 depicts the wild-type control for this reaction. Only 
non-specific amplification is observed. Lanes 4 and 5 depict the DHB deleted strains 
amplified with primers 9 and 12. This 2 kb band amplifies through the antibiotic region to 
below the downstream section of the inactivated chromosomal segment. Lane 6 is the 
negative control for this reaction and a band is not illustrated. Lanes 7 and 8 depict the 
deletion strains amplified with primers 1 and 4 and the illustration confirms that the DHB 
region is missing. Lane 9 is the wild-type control. 

Figure 5 illustrates gel electrophoresis of two clones of a production strain of 
Bacillus subtilis (wild-type) wherein s/ris replaced with a phleomycin (phleo) marker which 
results in a deletion of the s/r gene. Lanes 1 and 2 represent the clones amplified with 
primers at locations 1 and 1 1 . Lane 3 is the wild-type chromosomal DNA amplified with 
the same primers. A 1 .2 kb band is observed for the insert. Lanes 4 and 5 represent the 
clones amplified with primers at locations 9 and 12. Lane 6 is the wild-type chromosomal 
DNA amplified with the same primers. Conrect transfomiants include a 2 kb band. Lanes 7 
and 8 represent the clones amplified with primers at locations 2 and 4. Lane 9 is the wild- 
type chromosomal DNA amplified with the same primers. No band Is observed for the 
deletion strains, but a band around 1 kb Is observed in the wild-type. Reference Is made 
to Figure 2 for an explanation of primer locations. 
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Figure 6 provides an electrophoresis gel of a clone of a production strain of 
Bacillus subtilis (wild-type) wherein cssS is inactivated by the integration of a spc marker 
into the chromosome. Lane 1 is a control without the integration and is approximately 
1 .51cb smaller. 

Figure 7 provides a bar graph showing improved subtilisin secretion measured 
from shake flask cultures with Bacillus subtilis wild-type strain (unaltered) and 
corresponding altered Bacillus subtilis strains having various deletions. Protease activity 
(g/L) was measured after 17, 24 and 40 hours or was measured at 24 and 40 hours. 

Figure 8 provides a bar graph showing improved protease secretion as measured 
from shake flask cultures in Bacillus subtilis wild-type strain (unaltered) and corresponding 
altered deletion strains {-sbo) and (-s/r). Protease activity (g/L) was measured after 17, 24 
and 40 hours. 

DESCRIPTION OF THE INVENTION 

The present Invention provides cells that have been genetically manipulated to 
have an altered capacity to produce expressed proteins. In particular, the present 
invention relates to Gram-positive microorganisms, such as Bacillus species having 
enhanced expression of a protein of interest, wherein one or more chromosomal genes 
have been inactivated or othenA/ise modified. In some preferred embodiments, one or 
more chromosomal genes have been deleted from the Bacillus chromosome. In some 
further embodiments, one or more Indigenous chromosomal regions have been deleted 
from a corresponding wild-type Bacillus host chromosome. 

Definitions 

All patents and publications, including all sequences disclosed within such patents 
and publications, referred to herein are expressly incorporated by reference. Unless 
defined othenA^ise herein, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill In the art to which this invention 
belongs (See e.g., Singleton et aL, Dictionary of Microbiology and Molecular 
Biology. 2d Ed., John Wiley and Sons, New York [1994]; and Hale and Marham, THE 
Harper Collins Dictionary of biology, Harper Perennial, NY [1991], both of which 
provide one of skill with a general dictionary of many of the terms used herein). Although 
any methods and materials similar or equivalent to those described herein can be used in 
the practice or testing of the present invention, the prefenred methods and materials are 
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described. Numeric ranges are inclusive of the numbers defining the range. As used 
herein and in the appended claims, the singular V, "an" and Ihe" includes the plural 
reference unless the context clearly dictates otherwise. Thus, for example, reference to a 
"host cell" Includes a plurality of such host cells. 

Unless othenA/lse indicated, nucleic acids are written left to right in 5! to 3' 
orientation; amino add sequences are written left to right in amino to carboxy orientation, 
respectively. The headings provided herein are not limitations of the various aspects or 
embodiments of the invention that can be had by reference to the specification as a whole. 
Accordingly, the terms defined immediately below are more fully defined by reference to 
the Specification as a whole. 

As used herein, "host cell" refers to a cell that has the capacity to act as a host or 
expression vehicle for a newly introduced DNA sequence. In preferred embodiments of 
the present invention, the host cells are Bacillus sp. or E. coll cells. 

As used herein, "the genus BaclllusT includes all species within the genus 
"Bacillus' as known to those of skill in the art, including but not limited to B. subtilis, B. 
licheniformis, B. lentus, B. brevis, B. stearothermophilus, 8. alkalophilus, S. 
amyloliquefaciens, B. clausii, S. halodurans, B. megaterium, B. coagulans, 8. circulans, 8. 
lautus, and 8. thuringiensis. It is recognized that the genus Bacillus continues to undergo 
taxonomical reorganization. Thus, it is intended that the genus include species that have 
been reclassified, including but not limited to such organisms as 8. stearothermophilus, 
which is now named ^'Geobacillus stearothermophilus!* The production of resistant 
endospores in the presence of oxygen is considered the defining feature of the geniis 
Bacillus, although this characteristic also applies to the recently named Alicyclobacillus, 
Amphibacillus, Aneurinibacillus, Anoxybacillus, Brevibacillus, Filobacillus, Gracilibacillus, 
Halobacillus, Paenibacillus, Salibacillus, Thermobacillus, Ureibacillus, and Virgibacillus. 

As used herein, "nucleic acid" refers to a nucleotide or polynucleotide sequence, 
and fragments or portions thereof, as.well as to DNA, cDNA, and RNA of genomic or 
synthetic origin which may be double-stranded or single-stranded, whether representing 
the sense or antisense strand. It will be understood that as a result of the degeneracy of 
the genetic code, a multitude of nucleotide sequences may encode a given protein. 

As used herein the term "gene" means a chromosomal segment of DNA involved 
in producing a polypeptide chain that may or may not include regions preceding and 
following the coding regions (e.g. 5* untranslated (5' UTR) or leader sequences and 3' 
untranslated (3* UTR) or trailer sequences, as well as intervening sequence (introns) 
between individual coding segments (exons)). 
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In some embodiments, the gene encodes therapeutically significant proteins or 
peptides, such as growth factors, cytokines, ligands, receptors and inhibitors, as well as 
vaccines and antibodies. The gene may encode commercially important industrial 
proteins or peptides, such as enzymes (e.g.. proteases, carbohydrases such as amylases 
and glucoamylases, cellulases. oxidases and lipases). However, it is not intended that the 
present invention be limited to any particular enzyme or protein. In some embodiments, 
the gene of interest is a naturaliy-occunring gene, while in other embodiments, it is a 
mutated gene or a synthetic gene. 

As used herein, the term "vector" refers to any nucleic acid that can be replicated 
in cells and can carry new genes or DNA segments into cells. Thus, the tenm refers to a 
nucleic acid construct designed for transfer between different host cells. An "expression 
vector*' refers to a vector that has the ability to incorporate and express heterologous DNA 
fragments in a foreign cell. Many prokaryotic and eukaryotic expression vectors are 
commercially available. Selection of appropriate expression vectors is within the 
knowledge of those having skill In the art 

As used herein, the ternis "DNA construct," "expression cassette," and "expression 
vector," refer to a nucleic acid construct generated recombinantly or synthetically, with a 
series of specified nucleic acid elements that permit transcription of a particular nucleic 
acid in a target cell (/.e., these are vectors or vector elements, as described above). The 
recombinant expression cassette can be incorporated into a plasmid, chromosome, 
mitochondrial DNA, plastid DNA, vims, or nucleic acid fragment. Typically, the 
recombinant expression cassette portion of an expression vector includes, among other 
sequences, a nucleic acid sequence to be transcribed and a promoter. In some 
embodiments, DNA constructs also include a series of specified nucleic acid elements 
that permit transcription of a particular nucleic acid in a target cell. In one embodiment, a 
DNA construct of the invention comprises a selective marker and an inactivating 
chromosomal segment as defined herein. 

As used herein, "transforming DNA," "transforming sequence," and "DNA 
construcT refer to DNA that is used to introduce sequences into a host cell or organism. 
Transfomiing DNA is DNA used to introduce sequences into a host cell or organism. The 
DNA may be generated in vitro by PGR or any other suitable techniques. In some 
preferred embodiments, the transforming DNA comprises an incoming sequence, while in 
other preferred embodiments it further comprise an incoming sequence flanked by 
homology boxes. In yet a further embodiment, the transforming DNA comprises other 
non-homologous sequences, added to the ends (/.a, stuffer sequences or flanks). The 
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ends can be closed such that the transforming DNA forms a closed circle, such as, for 
example, insertion into a vector. 

As used herein, the term "plasmid" refers to a circular double-stranded (ds) DNA 
construct used as a cloning vector, and which forms an extrachromosomal self-replicating 
genetic element in many bacteria and some eukaryotes. In some embodiments, plasmids 
become incorporated Into the genome of the host cell. 

As used herein, the ternris "isolated" and "purified" refer to a nucleic acid or amino 
acid (or other component) that is removed from at least one component with which It is 
naturally associated. 

As used herein, the tenti "enhanced expression" Is broadly construed to Include 
enhanced production of a protein of interest. Enhanced expression is that expression 
above the normal level of expression in the cdnBSponding host strain that has not been 
altered according to the teachings herein but has been grown under essentially the same 
growth conditions. 

In some prefen^ed embodiments, "enhancemenf Is achieved by any modification 
that results in an increase In a desired property. For example, in some particularly 
prefen-ed embodiments, the present invention provides means for enhancing protein 
production, such that the enhanced strains produced a greater quantity and/or quality of a 
protein of interest than the parental strain (e.g., the wild-type and/or originating strain). 

As used herein the term "expression" refers to a process by which a polypeptide is 
produced based on the nucleic acid sequence of a gene. The process includes both 
transcription and translation. 

As used herein in the context of introducing a nucleic acid sequence into a cell, the 
tenfn "introduced" refers to any method suitable for transferring the nucleic acid sequence 
into the cell. Such methods for introduction include but are not limited to protoplast fusion, 
transfection, transformation, conjugation, and transduction (See e.g., Fen-ari et aL, 
"^Genetics," In Hardwood etal, (eds.). Bacillus . Plenum Publishing Corp., pages 57-72, 
[1989]). 

As used herein, the temis "transformed" and "stably transfomied" refers to a cell 
that has a non-native (heterologous) polynucleotide sequence integrated into its genome 
or as an episomal plasmid that is maintained for at least two generations. 

As used herein "an incoming sequence" refers to a DNA sequence that is introduced 
into the Bacillus chromosome. In some preferred embodiments, the incoming sequence is 
part of a DNA construct. In prefen^ed embodiments, the incoming sequence encodes one or 
more proteins of interest. In some embodiments, the incoming sequence comprises a 
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sequence that may or may not already be present in the genome of the cell to be 
transfomied (/.e., it may be either a homologous or heterologous sequence). In some 
embodiments, the incoming sequence encodes one or more proteins of interest, a gene, 
and/or a mutated or modified gene. In altemative embodiments, the incoming sequence 
encodes a functional wild-type gene or operon, a functional mutant gene or operon, or a non- 
functional gene or operon. In some embodiments, the non-functional sequence may be 
inserted into a gene to disrupt function of the gene. In some embodiments, the incoming 
sequence encodes one or more functional wild-type genes, while in other embodiments, the 
incoming sequence encodes one or more functional mutant genes, and in yet additional 
embodiments, the incoming sequence encodes one or more non-functional genes. In 
another embodiment, the incoming sequence encodes a sequence that is already present in 
the chromosome of the host cell to be transfomied. In a prefenred embodiment, the incoming 
sequence comprises a gene selected from the group consisting of sbo, sir, ybcO, csn, 
spollSA, phrC, sigB, mpA, CssS, trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, alsD, sIgD, prpC, 
gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and rocD, and fragments thereof. In yet another 
embodiment, the incoming sequence includes a selective marker. In a further embodiment 
the incoming sequence includes two homology boxes. 

In some embodiments, the incoming sequence encodes at least one heterologous 
protein including, but not limited to hormones, enzymes, and growth factors. In another 
embodiment, the enzyme includes, but is not limited to hydrolases, such as protease, 
esterase, lipase, phenol oxidase, penmease. amylase, pullulanase, oellulase, glucose 
isomerase, laccase and protein disulfide isomerase. 

As used herein, "homology box" refers to a nucleic acid sequence, which is 
homologous to a sequence In the Bacillus chromosome. More specifically, a homology 
box is an upstream or downstream region having between about 80 and 100% sequence 
identity, between about 90 and 100% sequence identity, or between about 95 and 100% 
sequence Identity with the immediate flanking coding region of a gene or part of a gene to 
be inactivated according to the invention. These sequences direct where in the Bacillus 
chromosome a DNA construct is integrated and directs what part of the Bacillus 
chromosome is replaced by the incoming sequence. While not meant to limit the 
invention, a homology box may include about between 1 base pair (bp) to 200 kilobases 
(kb). Preferably, a homology box includes about between 1 bp and 10.0 kb; between 1 bp 
and 5.0 kb; between 1 bp and 2.5 kb; between 1 bp and 1 .0 kb, and between 0.25 kb and 
2.5 kb . A homology box may also include about 10.0 kb, 5.0 kb, 2.5 kb, 2.0 kb, 1.5 kb, 1.0 
kb, 0.5 kb, 0.25 kb and 0.1 kb. In some embodimente, the 5' and 3' ends of a selective 
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marker are flanked by a homology box wherein the homology box comprises nucleic acid 
sequences immediately flanking the coding region of the gene. 

As used herein, the temfi ''selectable marker-encoding nucleotide sequence" refers 
to a nucleotide sequence which is capable of expression in the host cells and where 
expression of the selectable marker confers to cells containing the expressed gene the 
ability to grow in the presence of a corresponding selective agent or lack of an essential 
nutrient. 

As used herein, the temis "selectable marker^ and "selective marker^ refer to a 
nucleic acid {e.g., a gene) capable of expression in host cell which allows for ease of 
selection of those hosts containing the vector. Examples of such selectable markers 
include but are not limited to antimicrobials. Thus, the tenn "selectable mariner" refers to 
genes that provide an indication that a host cell has taken up an incoming DNA of interest 
or some other reaction has occun^ed. Typically, selectable markers are genes that confer 
antimicrobial resistance or a metabolic advantage on the host cell to allow cells containing 
the exogenous DNA to be distinguished from cells that have not received any exogenous 
sequence during the transformation. A "residing selectable maricer" is one that is located 
on the chromosome of the microorganism to be transfomied. A residing selectable marker 
encodes a gene that is different ft-om the selectable marker on the transfomning DNA 
construct. Selective markers are well known to those of skill in the art. As indicated 
above, preferably the marker is an antimicrobial resistant marker (e.g., amp*^; phleo''; 
spec''; kan'^; ery''; tet^; cmp"; and neo^; See e.g., Guerot-Fleury, Gene, 167:335-337 
[1995]; Palmeros ef a/., Gene 247:255-264 [2000]; and Trieu-Cuot et aL, Gene, 23:331- 
341 [1983]). In some particularly preferred embodiments, the present invention provides a 
chloramphenicol resistance gene (e.g., the gene present on pC194, as well as the 
resistance gene present in the Bacillus licheniformis genome). This resistance gene is 
particularly useful in the present invention, as well as in embodiments involving 
chromosomal amplification of chromosomally integrated cassettes and integrative 
plasmids(See e.g.. Aibertini and Galizzi, BacterioL, 162:1203-1211 [1985]; and Stahl and 
Fen-ari. J. BacterioL, 158:411-418 [1984]). The DNA sequence of this naturally-occurring 
chloramphenicol resistance gene is shown below: 

ATGAATTTTCAAACAATCiSAGCTTGACACATGGTATAGAAAATCTTATTTTGACCATTA 

CATGAAGGAAGCGAAATGTTCTTTCAGCATCACGGCAAACGTCAATGTGACAAATTTG 

CTCGCCGTGCTCAAGAAAAAGAAGCTCAAGCTGTATCCGGCTTTTATTTATATCGTAT 

CAAGGGTCATTCATTCGCGCCCTGAGTTTAGAACAACGTTTGATGACAAAGGAAGCT 

GGGTTATTGGGAACAAATGCATCCGTGCTATGCGATTTTTCATCAGGACGACGAAAC 

GTTTTCCGCCGTCTGGACGGAATACTCAGACGATTTTTCGCAGTTTTATCATCAATAT 
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CTTCTGGACGCCGAGCGCTTTGGAGACAAAAGGGGCCTTTGGGCTAAGCCGGACAT 
CCCGCCCAATACGmTCAGmCTTCTATTCCATGGGTGCGCmTCAACATTCAATT 
TAAACCTTGATAACAGCGAACACTTGCTGCCGATTATTACAAACGGGAAATACTTTTC 
AGAAGGCAGGGAAACATTTTTGCCCGTTTCCTGCAAGTTCACCATGCAGTGTGTGAC 
GGCTATCATGCCGGCGCTTTTATAA(SEQ ID NO:58). 

The deduced amino acid sequence of this chloramphenicol resistance protein is: 

MNFQTIELDTWYRKSYFDHYIVlKEAKCSFSiTANVNVTNLLAVLKKKKLKLYPAFIYIVSRVI 
HSRPEFRTTFDDKGQLGYWEQMHPCYAIFHQDDQTFSALWTEYSDDFSQFYHQYLLDA 
ERFGDKRGLWAKPDIPPNTFSVSSIPWVRFSTFNLNLDNSEHLLPIITNGKYFSEGRETFL 
PVSCKFTMQCVTAIMPALL (SEQ ID NO:59). 

Other markers useful in accordance with the invention include, but are not limited 
to auxotrophic markers, such as tryptophan; and detection markers, such as 
galactosidase. 

As used herein, the term "promoter" refers to a nucleic acid sequence that 
functions to direct transcription of a downstream gene. In preferred embodiments, the 
promoter is appropriate to the host cell in which the target gene is being expressed. The 
promoter, together with other transcriptional and translational regulatory nucleic acid 
sequences (also termed "control sequences") is necessary to express a given gene. In 
general, the transcriptional and translational regulatory sequences Include, but are not 
limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 
sequences, translational start and stop sequences, and enhancer or activator sequences. 

A nucleic acid is "operably linked" when it is placed into a functional relationship 
with another nucleic acid sequence. For example, DNA encoding a secretory leader {i.e., 
a signal peptide), is operably linked to DNA for a polypeptide if it is expressed as a 
preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is 
operably linked to a coding sequence if it affects the transcription of the sequence; or a 
ribosome binding site is operably linked to a coding sequence if it is positioned so as to 
facilitate translation. Generally, "operably linked" means that the DNA sequences being 
linked are contiguous, and, in the case of a secretory leader, contiguous and in reading 
phase. However, enhancers do not have to be contiguous. Linking is accomplished by 
ligation at convenient restriction sites. If such sites do not exist, the synthetic 
oligonucleotide adaptors or linkers are used in accordance with conventional practice. 

The tenn "inactivation" includes any method that prevents the functional 
expression of one or more of the sbo, sir, ybcO, csn, spollSA, sigB, phrC, rapA, CssS, 
trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, alsD, sigD, prpC, gapB, pckA, ftp, rocA, ycgN, 
ycgM, rocF, and mcD chromosomal genes, wherein the gene or gene product is unable to 
exert Its known function. Inactivation or enhancement occurs via any suitable means, 



wo 03/083125 



PCT/US03/09585 



-23- 

including deletions, substitutions (e.g., mutations), intenxiptions, and/or insertions in the 
nucleic acid gene sequence. In one embodiment, the expression product of an inactivated 
gene is a truncated protein with a corresponding change in the biological activity of the 
protein. In some embodiments, the change in biological activity is an increase in activity, 
while in preferred embodiments, the change is results in the loss of biological activity. In 
some embodiments, an altered Bacillus strain comprises inactivation of one or more 
genes that results preferably in stable and non-reverting inactivation. 

In some preferred embodiments, Inactivation is achieved by deletion. In some 
prefenred embodiments, the gene Is deleted by homologous recombination. For example, 
in some embodiments when sbo is the gene to be deleted, a DNA construct comprising an 
incoming sequence having a selective marker flanked on each side by a homology box is 
used. The homology box comprises nucleotide sequences homologous to nucleic acids 
flanking regions of the chromosomal sbo gene. The DNA construct aligns with the 
homologous sequences of the Bacillus host chromosome and in a double crossover event 
the sbo gene is excised out of the host chromosome. 

As used herein, "deletion" of a gene refers to deletion of the entire coding 
sequence, deletion of part of the coding sequence, or deletion of the coding sequence 
including flanking regions. The deletion may be partial as long as the sequences left in 
the chromosome provides the desired biological activity of the gene. The flanking regions 
of the coding sequence may Include from about 1bp to about 500 bp at the 5' and 3' ends. 
The flanking region may be larger than 500 bp but will preferably not include other genes 
in the region which may be inactivated or deleted according to the invention. The end 
result is that the deleted gene is effectively non-functional. In simple terms, a "deletion" is 
defined as a change in either nucleotide or amino acid sequence in which one or more 
nucleotides or amino acid residues, respectively, have been removed {i.e., are absent). 
Thus, a "deletion mutant" has fewer nucleotides or amino acids than the respective wild- 
type organism. 

In still another embodiment of the present invention, deletion of a gene active at an 
inappropriate time as determined by DNA array analysis (e.g., transcriptome analysis, as 
described herein) provides enhanced expression of a product protein. In some preferred 
embodiments, deletion of one or more of genes selected from the group consisting of 
pckA, gapB, fbp, and/or alsD, provides an improved strain for the improved efficiency of 
feed utilization. As used herein, "transcriptome analysis" refers to the analysis of gene 
transcription. 
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In another embodiment of the present invention, a gene is considered to be 
"optimized" by the deletion of a regulatory sequence In which this deletion results in 
increased expression of a desired product. In some preferred embodiments of the present 
invention, the tryptophan operon (/.e., comprising genes trpA trpB, trpC, trpD, trpE, trpF) 
is optimized by the deletion of the DNA sequence coding for the TRAP binding RNA 
sequence (See, Yang, et al, J Mol. Biol., 270:696-710 [1997]). This deletion is 
contemplated to increase expression of the desired product from the host strain. 

In another preferred embodiment, inactivation is by insertion. For example, in 
some embodiments, when sbo is the gene to be inactivated, a DNA construct comprises 
an incoming sequence having the sbo gene intermpted by a selective mariner. The 
selective marker will be flanked on each side by sections of the sbo coding sequence. The 
DNA constnjct aligns with essentially identical sequences of the sbo gene in the host 
chromosome and in a double crossover event the sbo gene is inactivated by the insertion 
of the selective marker. In simple terms, an "insertion" or "addition" is a change in a 
nucleotide or amino acid sequence which has resulted in the addition of one or more 
nucleotides or amino acid residues, respectively, as compared to the naturally occuning 
sequence. 

In another embodiment, activation is by insertion in a single crossover event with a 
plasmid as the vector. For example, a sbo chromosomal gene is aligned with a plasmid 
comprising the gene or part of the gene coding sequence and a selective marker. In some 
embodiments, the selective marker is located within the gene coding sequence or on a 
part of the plasmid separate from the gene. The vector is integrated into the Bacillus 
chromosome, and the gene is Inactivated by the insertion of the vector in the coding 
sequence. ' 

In altemative embodiments, inactivation results due to mutation of the gene. 
Methods of mutating genes are well known in the art and include but are not limited to site- 
directed mutation, generation of random mutations, and gapped-duplex approaches (See 
e.g., U.S. Pat. 4,760,025; Moring et ai, Biotech. 2:646 [1984]; and Kramer et al.. Nucleic 
Acids Res., 12:9441 [1984]). 

As used herein, a "substitution" results from the replacement of one or more 
nucleotides or amino acids by different nucleotides or amino adds, respectively. 

As used herein, "homologous genes" refers to a pair of genes from different, but 
usually related species, which correspond to each other and which are identical or very 
similar to each other. The term encompasses genes that are separated by speciation {i.e.. 
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the development of new species) (e.g., orthologous genes), as well as genes that have 
been separated by genetic duplication {e.g., paralogous genes). 

As used herein, "ortholog" and "orthologous genes" refer to genes in different 
species that have evolved from a common ancestral gene (I.e., a homologous gene) by 
speciation. Typically, orthologs retain the same function in during the course of evolution. 
Identification of orthologs finds use in the reliable prediction of gene function in newly 
sequenced genomes. 

As used herein, *'paralog" and "paralogous genes" refer to genes that are related 
by duplication within a genome. While orthologs retain the same function through the 
course of evolution, paralogs evolve new functions, even though some functions are often 
related to the original one. Examples of paralogous genes include, but are not limited to 
genes encoding trypsin, chymotrypsin, elastase, and thrombin, which are all serine 
proteinases and occur together within the same species. 

As used herein, "homology" refers to sequence similarity or identity, with identity 
being prefenred. This homology Is detennined using standard techniques known in the art 
(See e.g.. Smith and Watemnan, Adv. AppL Math., 2:482 [1981]; Needleman and 
Wunsch, J. IVIol. Biol., 48:443 [1970]; Pearson and LIpman, Proc. Natl. Acad. Sci. USA 
85:2444 [1988]; programs such as GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin 
Genetics Software Pacl^age (Genetics Computer Group, Madison, Wl); and Devereux et 
al., Nucl. Acid Res., 12:387-395 [1984]). 

As used herein, an "analogous sequence" is one wherein the function of the gene 
is essentially the same as the gene designated from Bacillus subtilis strain 168. 
Additionally, analogous genes include at least 60%, 65%, 70%, 75%. 80%, 85%. 90%, 
95%, 97%, 98%, 99% or 100% sequence identity with the sequence of the Bacillus subtilis 
strain 168 gene. Alternately, analogous sequences have an alignment of between 70 to 
100% of the genes found in the S. subtilis 168 region and/or have at least between 5-10 
genes found in the region aligned with the genes in the B. subtilis 168 chromosome. In 
additional embodiments more than one of the above properties applies to the sequence. 
Analogous sequences are determined by known methods of sequence alignment. A 
commonly used alignment method is BLAST, although as indicated above and below, 
there are other methods that also find use in aligning sequences. 

One example of a useful algorithm is PILEUP. PILEUP creates a multiple 
sequence alignment from a group of related sequences using progressive, painvise 
alignments. It can also plot a tree showing the clustering relationships used to create the 
alignment. PILEUP uses a simplification of the progressive alignment method of Feng and 
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Doolittle (Feng and Doolittle. J. Mol. EvoL, 35:351-360 [1987]). The method is similar to 
that described by Higgins and Sharp (Higgins and Sharp, CABIOS 5:151-153 [1989]). 
Useful PILEUP parameters including a default gap weight of 3.00, a default gap length 
weight of 0.10, and weighted end gaps. 

Another example of a useful algorithm is the BLAST algorithm, described by 
Altschul ef a/., (Altschul ef a/.. J. Mol. Biol., 215:403-410, [1990]; and Karlin ef a/., Proc. 
Natl. Acad. Sci. USA 90:5873-5787 [1993]). A particularly useful BLAST program is the 
WU-BLAST-2 program (See, Altschul ef a/., Meth. Enzymol.,. 266:460-480 [1996]). WU- 
BLAST-2 uses several search parameters, most of which are set to the default values. 
The adjustable parameters are set with the following values: overlap span =1 , overlap 
fraction = 0.125, word threshold (T) = 1 1 . The HSP S and HSP S2 parameters are 
dynamic values and are established by the program itself depending upon the composition 
of the particular sequence and composition of the particular database against which the 

sequence of interest is being searched. However, the values may be adjusted to increase 

» 

sensitivity. A % amino acid sequence identity value is detennined by the number of 
matching identical residues divided by the total number of residues of the "longer" 
sequence in the aligned region. The "longer" sequence is the one having the most actual 
residues in the aligned region (gaps introduced by WU-Blast-2 to maximize the alignment 
score are ignored). 

Thus, "percent (%) nucleic acid sequence identity" is defined as the percentage of 
nucleotide residues in a candidate sequence that are Identical with the nucleotide residues 
of the sequence shown in the nucleic acid figures. A preferred method utilizes the 
BLASTN module of WU-BLAST-2 set to the default parameters, with overlap span and 
overlap fraction set to 1 and 0.125, respectively. 

The alignment may include the introduction of gaps in the sequences to be aligned. 
In addition, for sequences which contain either more or fewer nucleosides than those of 
the nucleic acid figures, it is understood that the percentage of homology wljl be 
determined based on the number of homologous nucleosides in relation to the total 
number of nucleosides. Thus, for example, homology of sequences shorter than those of 
the sequences identified herein and as discussed below, will be determined using the 
number of nucleosides In the shorter sequence. 

As used herein, the temi "hybridization" refers to the process by which a strand of 
nucleic acid joins with a complementary strand through base pairing, as known in the art. 

A nucleic acid sequence is considered to be "selectively hybridizabte" to a 
reference nucleic acid sequence if the two sequences specifically hybridize to one another 
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under moderate to high stringency hybridization and wash conditions. Hybridization 
conditions are based on the meiting temperature (Tm) of the nucleic acid binding compiex 
or probe. For example, "maximum stringency" typically occurs at about Tm-5**C (5** below 
the Tm of the probe); "high stringency" at about 5-10**C below the Tm; "intemiediate 
stringency" at about 10-20''C below the Tm of the probe; and "low stringency" at about 20- 
25*^0 below the Tm. Functionally, maximum stringency conditions may be used to identify 
sequences having strict identity or near-strict identity with the hybridization probe; while an 
Intermediate or low stringency hybridization can be used to identify or detect 
polynucleotide seq uence homologs. 

Moderate and high stringency hybridization conditions are well known in the art. 
An example of high stringency conditions includes hybridization at about 42^C in 50% 
formamlde, 5X SSC, 5X Denhardfs solution, 0.5% SDS and 100 [xg/mi denatured carrier 
DNA followed by washing two times in 2X SSC and 0.5% SDS at room temperature and 
two additional times in 0.1X SSC and 0.5% SDS at 42°C. An example of moderate 
stringent conditions include an overnight incubation at 37°C in a solution comprising 20% 
fonnamide, 5 x SSC (150mM NaCI, 15 mlVI trisodium citrate), 50 mM sodium phosphate 
(pH 7.6), 5 x Denhardfs solution, 10% dextran sulfate and 20 mg/ml denaturated sheared 
salmon sperm DNA, followed by washing the filters in 1x SSC at about 37 - 50**C. Those 
of skill in the art know how to adjust the temperature, ionic strength, etc. as necessary to 
accommodate factors such as probe length and the like. 

As used herein, "recombinanr includes reference to a cell or vector, that has been 
modified by the introduction of a heterologous nucleic acid sequence or that the cell is 
derived from a cell so modified. Thus, for example, recombinant cells express genes that are 
not found in identical form within the native (non-recombinant) fonm of the cell or express 
native genes that are otherwise abnormally expressed, under expressed or not expressed at 
all as a result of deliberate human intervention. "Recombination, "recombining." or 
generating a "recombined" nucleic acid is generally the assembly of two or more nucleic acid 
fragments wherein the assembly gives rise to a chimeric gene. 

In a preferred embodiment, mutant DNA sequences are generated with site 
saturation mutagenesis in at least one codon. In another preferred embodiment, site 
saturation mutagenesis is perfomied for two or more codons. In a further embodiment, 
mutant DNA sequences have more than 40%, more than 45%, more than 50%, more than 
55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, 
more than 85%, more than 90%, more than 95%, or more than 98% homology with the 
wild-type sequence, in alternative embodiments, mutant DNA is generated in vivo using 
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any known mutagenic procedure such as, for example, radiation, nitrosoguanidine and the 
like. The desired DNA sequence is then isolated and used in the methods provided 
herein. 

In an alternative embodiment, the transfomning DNA sequence comprises 
homology boxes without the presence of an incoming sequence. In this embodiment, it Is 
desired to delete the endogenous DNA sequence between the two homology boxes. 
Furthermore, in some embodiments, the transforming sequences are wild-type, while in 
other embodiments, they are mutant or modified sequences. In addition, in some 
embodiments, the transfomning sequences are homologous, while in other embodiments, 
they are heterologous. 

As used herein, the temn "target sequence** refers to a DNA sequence in the host cell 
that encodes the sequence where it is desired for the incoming sequence to be inserted into 
the host cell genome. In some embodiments, the target sequence encodes a functional wild- 
type gene or operon. while In other embodiments the target sequence encodes a functional 
mutant gene or operon, or a non-functional gene or operon. 

As used herein, a **flanking sequence" refers to any sequence that is either 
upstream or downstream of the sequence being discussed (e.g., for genes A-B-C, gene B 
is flanked by the A and C gene sequences). In a prefen^ed embodiment, the incoming 
sequence Is flanked by a homology box on each side. In another embodiment, the 
incoming sequence and the homology boxes comprise a unit that is flanked by stuffer 
sequence on each side. In some embodiments, a flanking sequence is present on only a 
single side (either 3' or 5'), but in preferred embodiments, it Is on each side of the 
sequence being flanked. The sequence of each homology box is homologous to a 
sequence In the Bacillus chromosome. These sequences direct where in the Bacillus 
chromosome the new construct gets integrated and what part of the Bacillus chromosome 
will be replaced by the incoming sequence. In a prefenred embodiment, the 5' and 3* ends 
of a selective marker are flanked by a polynucleotide sequence comprising a section of 
the inactivating chromosomal segment. In some embodiments, a flanking sequence is 
present on only a single side (either 3' or 5'), while in preferred embodiments, it is present 
on each side of the sequence being flanked. 

As used herein, the term "stuffer sequence" refers to any extra DNA ttiat flanks 
homology boxes (typically vector sequences). However, the term encompasses any non- 
homologous DNA sequence. Not to be limited by any tiieory, a stuffer sequence provides 
a noncritical target for a cell to initiate DNA uptake. 
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As used herein, the term "library of mutants" refers to a population of cells which are 
identical in most of their genome but Include diflFerent homologues of one or more genes. 
Such libraries find use for example, in methods to identify genes or operons with improved 
traits. 

As used herein, the ternns "hypercompetent" and "super competent" mean that 
greater than 1% of a cell population is transfonmable with chromosomal DNA {e.g., Bacillus 
DNA). Altematively, the temns are used In reference to cell populations in which greater 
than10% of a cell population is transfomnable with a self-replicating plasmid (e.g., a Bacillus 
plasmid). Preferably, the super competent cells are transfonned at a rate greater than 
observed for the wild-type or parental cell population. Super competent and hypercompetent 
are used Interchangeably herein. 

As used herein, the temis "amplification" and "gene amplification" refer to a 
process by which specific DNA sequences are disproportionately replicated such that the 
amplified gene becomes present in a higher copy number than was initially present in the 
genome. In some embodiments, selection of cells by growth in the presence of a drug 
(e.g., an inhibitor of an Inhlbitable enzyme) results in the amplification of either the 
endogenous gene encoding the gene product required for growth in the presence of the 
drug or by amplification of exogenous {i.e., input) sequences encoding this gene product, 
or both. 

"Amplification" is a special case of nucleic acid replication involving template 
specificity. It is to be contrasted with non-specific template replication (/.e.. replication that 
is template-dependent but not dependent on a specific template). Template specificity is 
here distinguished from fidelity of replication (/.e., synthesis of the proper polynucleotide 
sequence) and nucleotide (ribo- or deoxyribo-) specificity. Template specificity is 
frequently described in temis of "target" specificity. Target sequences are "targets" in the 
sense that they are sought to be sorted out from other nucleic acid. Amplification 
techniques have been designed primarily for this sorting out. 

As used herein, the term "co-amplification" refers to the introduction into a single 
cell of an amplifiabie marker in conjunction with other gene sequences (/.e., comprising 
one or more non-selectable genes such as those contained within an expression vector) 
and the application of appropriate selective pressure such that the cell amplifies both the 
amplifiabie marlcer and the other, non-selectable gene sequences. The amplifiabie marker 
may be physically linked to the other gene sequences or altematively two separate pieces 
of DNA, one containing the amplifiabie marker and the other containing the non-selectable 
marker, may be introduced into the same cell. 
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As used herein, the terms "amplifiable marker," "amplifiable gene," and 
"amplification vector" refer to a gene or a vector encoding a gene which permits the 
amplification of that gene under appropriate growth conditions. 

"Template specificity" is achieved in most amplification techniques by the choice of 
5 enzyme. Amplification enzymes are enzymes that, under conditions they are used, will 
process only specific sequences of nucleic acid in a heterogeneous mixture of nucleic 
add. For example, in the case of Qp replicase, MDV-1 RNA is the specific template for the 
replicase (See e.g., Kacian et ai, Proc. Natl. Acad. Sci. USA 69:3038 [1972]). Other 
nucleic acids are not replicated by this amplification enzyme. Similarly, in the case of T7 
10 RNA polymerase, this amplification enzyme has a stringent specificity for its own 

promoters (See. Chamberlin et ah, Nature 228:227 [1970]). In the case of T4 DNA ligase, 
the enzyme will not ligate the two oligonucleotides or polynucleotides, where there is a 
mismatch between the oligonucleotide or polynucleotide substrate and the template at the 
ligation junction (See, Wu and Wallace, Genomics 4:560 [1989]). Finally, Taq and Pfu 
15 polymerases, by virtue of their ability to function at high temperature, are found to display 
high specificity for the sequences bounded and thus defined by the primers; the high 
temperature results in thermodynamic conditions that favor primer hybridization with the 
target sequences and not hybridization with non-target sequences. 

As used herein, the term "amplifiable nucleic acid" refers to nucleic acids which 
20 may be amplified by any amplification method. It is contemplated that "amplifiable nucleic 
acid" will usually comprise "sample template." 

As used herein, the temn "sample template" refers to nucleic acid originating from a 
sample which is analyzed for the presence of "target" (defined below). In contrast, 
"background template" is used in reference to niicleic acid other than sample template 
28 which may or may not be present in a sample. Background template is most often 

inadvertent. It may be the result of carryover, or it may be due to the presence of nucleic 
acid contaminants sought to be purified away from the sample. For example, nucleic 
acids from organisms other than those to be detected may be present as background in a 
test sample. 

30 As used herein, the temn "primer" refers to an oligonucleotide, whether occunring 

naturally as in a purified restriction digest or produced synthetically, which is capable of 
acting as a point of initiation of synthesis when placed under conditions in which synthesis 
of a primer extension product which is complementary to a nucleic acid strand is induced, 
(i.e., in the presence of nucleotides and an inducing agent such as DNA polymerase and 

35 at a suitable temperature and pH). The primer Is preferably single stranded for maximum 
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efficiency in amplification, but may alternatively be double stranded. If double stranded, 
thie primer is first treated to separate its strands before being used to prepare extension 
products. Preferably, the primer is an oligodeoxyribonucleotide. The primer must be 
sufficiently long to prime the synthesis of extension products in the presence of the 
inducing agent. The exact lengths of the primers will depend on many factors, including 
temperature, source of primer and the use of the method. 

As used herein, the temn "probe" refers to an oligonucleotide (i.e., a sequence of 
nucleotides), whether occumng naturally as in a purified restriction digest or produced 
synthetically, recombinantly or by PGR amplification, which is capable of hybridizing to 
another oligonucleotide of interest A probe may be single-stranded or double-stranded. 
Probes are useful in the detection, identification and isolation of particular gene 
sequences. It is contemplated that any probe used in the present invention will be labeled 
with any "reporter molecule." so that is detectable in any detection system, including, but 
not limited to enzyme {e.g., ELISA, as well as enzyme-based histochemical assays), 
fluorescent, radioactive, and luminescent systems. It is not intended that the present 
invention be limited to any particular detection system or label. 

As used herein, the term "target," when used in reference to the polymerase chain 
reaction, refers to the region of nucleic acid bounded by the primers used for polymerase 
chain reaction. Thus, the "target" is sought to be sorted out from other nucleic acid 
sequences. A "segmenf is defined as a region of nucleic acid within the target sequence. 

As used herein, the tenm "polymerase chain reaction" ("PGR") refers to the 
methods of U.S. Patent Nos. 4.683,195 4,683,202. and 4.965.188, hereby incorporated by 
reference, which include methods for Increasing the concentration of a segment of a target 
sequence in a mixture of genomic DNA without cloning or purification. This process for 
amplifying the target sequence consists of introducing a large excess of two 
oligonucleotide primers to the DNA mixture containing the desired target sequence, 
followed by a precise sequence of thermal cycling in the presence of a DNA polymerase. 
The two primers are complementary to their respective strands of the double stranded 
target sequence. To. effect amplification, the mixture is denatured and the primers then 
annealed to their complementary sequences within the target molecule. Following 
annealing, the primers are extended with a polymerase so as to form a new pair of 
complementary strands. The steps of denaturation, primer annealing and polymerase 
extension can be repeated many times (/.e., denaturation, annealing and extension 
constitute one "cycle"; there can be numerous "cycles") to obtain a high concentration of 
an amplified segment of the desired target sequence. The length of the amplified segment 
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of the desired target sequence is determined by tlie relative positions of the primers with 
respect to each other, and therefore, this length is a controllable parameter. By virtue of 
the repeating aspect of the process, the method is referred to as the "polymerase chain 
reaction" (hereinafter "PGR"). Because the desired amplified segments of the target 
sequence become the predominant sequences (in terms of concentration) in the mixture, 
they are said to be TCR amplified". 

As used herein, the terni "amplification reagents" refers to those reagents 
(deoxyribonucleotide triphosphates, buffer, etc.), needed for amplification except for 
primers, nucleic acid template and the amplification enzyme. Typically, amplification 
reagents along with other reaction components are placed and contained in a reaction 
vessel (test tube, microwell, etc.). 

With PCR, it is possible to amplify a single copy of a specific target sequence in 
genomic DNA to a level detectable by several different methodologies {e.g., hybridization 
with a labeled probe; incorporation of biotinylated primers followed by avidin-enzyme 
conjugate detection; incorporation of ^P-labeled deoxynucleotide triphosphates, such as 
dCTP or dATP, into the amplified segment). In addition to genomic DNA, any 
oligonucleotide or polynucleotide sequence can be amplified with the appropriate set of 
primer molecules. In particular, the amplified segments created by the PCR process itself 
are, themselves, efficient templates for subsequent PCR amplifications. 

As used herein, the terms "PCR product," "PCR fragment." and "amplification 
product" refer to the resultant mixture of compounds after two or more cycles of the PCR 
steps of denaturation, annealing and extension are complete. These terms encompass 
the case where there has been amplification of one or more segments of one or more 
target sequences. 

As used herein, the temri "RT-PCR" refers to the replication and amplification of 
RNA sequences. In this method, reverse transcription is coupled to PCR, most often 
using a one enzyme procedure in which a thermostable polymerase is employed, as 
described in U.S. Patent No. 5,322,770, herein incorporated by reference. In RT-PCR, the 
RNA template is converted to cDNA due to the reverse transcriptase activity of the 
polymerase, and then amplified using the polymerizing activity of the polymerase {i.e., as 
in other PCR methods). 

As used herein, the temns "restriction endonucleases" and "restriction enzymes" 
refer to bacterial enzymes, each of which cut double-stranded DNA at or near a specific 
nucleotide sequence. 



wo 03/083125 



PCT/US03/09585 



-33- 

A "restriction site" refers to a nucleotide sequence recognized and cleaved by a 
given restriction endonuclease and is frequently the site for insertion of DNA fragments. In 
certain embodiments of the invention restriction sites are engineered Into the selective 
marker and into 5' and 3' ends of the DNA constmct. 

As used herein "an inactivating chromosomal segment" comprises two sections. 
Each section comprises polynucleotides that are homologous with the upstream or 
downstream genomic chromosomal DNA that immediately flanks an indigenous 
chromosome region as defined herein. "Immediately flanks" means the nucleotides 
comprising the inactivating chromosomal segment do not include the nucleotides defining 
the Indigenous chromosomal region. The inactivating chromosomal segment directs 
where in the Bacillus chromosome the DNA constmct gets integrated and what part of the 
Bacillus chromosome will be replaced. 

As used herein, "indigenous chromosomal region" and "a fragment of an 
indigenous chromosomal region" refer to a segment of the Bacillus chromosome which Is 
deleted from a Bacillus host cell in some embodiments of the present invention. In 
general, the terms "segment," "region," "section," and "element" are used interchangeably 
herein. In some embodiments, deleted segments Include one or more genes with known 
functions, while in other embodiments, deleted segments include one or more genes with 
unknown functions, and in other embodiments, the deleted segments include a 
combination of genes with known and unknown functions. In some embodiments, 
indigenous chromosomal regions or fragments thereof include as many as 200 genes or 
more. 

In some embodiments, an indigenous chromosomal region or fragment thereof has 
a necessary function under certain conditions, but the region is not necessary for Bacillus 
strain viability under laboratory conditions. Preferred laboratory conditions include but are 
not limited to conditions such as growth in a femienter, in a shake flask on plated media, 
etc., at standard temperatures and atmospheric conditions (e.g., aerobic). 

An indigenous chromosomal region or fragment thereof may encompass a range 
of about 0.5kb to 500 kb; about 1 .0 kb to 500 kb; about 5 kb to 500 kb; about 1 0 kb to 
500kb; about 10 kb to 200kb; about 10kb to 100kb; about 10kb to 50kb; about 100kb to 
500kb; and about 200kb to 500 kb of the Bacillus chromosome. In another aspect, when 
an indigenous chromosomal region or fragment thereof has been deleted, the 
chromosome of the altered Bacillus strain may include 99%, 98%. 97%. 96%, 95%. 94%. 
93%. 92%, 91%, 90%, 85%, 80%. 75% or 70% of the corresponding unaltered Bacillus 
host chromosome. Preferably, the chromosome of an altered Bacillus strain according to 
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the invention will include about 99 to 90%; 99 to 92%; and 98 to 94% of the corresponding 
unaltered Bacillus host strain chromosome genome. 

As used herein, "strain viability" refers to reproductive viability. The deletion of an 
indigenous chromosomal region or fragment thereof, does not deleteriously affect division 
and survival of the altered Bacillus strain under laboratory conditions. 

As used herein, "altered Bacillus strain" refers to a genetically engineered Bacillus 
sp. wherein a protein of interest has an enhanced level of expression and/or production as 
compared to the expression and/or production of the same protein of interest in a 
corresponding unaltered Bacillus host strain grown under essentially the same growth 
conditions. In some embodiments, the enhanced level of expression results from the 
inactivation of one or more chromosomal genes. In one embodiment, the enhanced level 
of expression results from the deletion of one or more chromosomal genes. In some 
embodiments, the altered Bacillus strains are genetically engineered Bacillus sp. having 
one or more deleted Indigenous chromosomal regions or fragments thereof, wherein a 
protein of Interest has an enhanced level of expression or production, as compared to a 
conresponding unaltered Bacillus host strain grown under essentially the same growth 
conditions. In an alternative embodiment, the enhanced level of expression results from 
the inserlional inactivation of one or more chromosomal genes. In some alternate 
embodiments, enhanced level of expression results due to increased activation or an 
othenA/ise optimized gene. In some preferred embodiments, the Inactivated genes are 
selected from the group consisting of s&o, sir, ybcO, csn, spollSA, phrC, sigB, mpA. CssS. 
trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, alsD, sigD, prpC, gapB, pckA, fbp, mcA, ycgN, 
ycgM, rocF, and rocD. 

In certain embodiments, the altered 6ac///i/s strain comprise two inactivated 
genes, while In other embodiments, there are three inactivated genes, four inactivated 
genes, five inactivated genes, six Inactivated genes, or more. Thus, it is not intended that 
the number of inactivated genes be limited to an particular number of genes. In some 
embodiments, the inactivated genes are contiguous to each another, while in other 
embodiments, they are located in separate regions of the Bacillus chromosome. In some 
embodiments, an inactivated chromosomal gene has a necessary function under certain 
conditions, but the gene is not necessary for Bacillus strain viability under laboratory 
conditions. Prefenred laboratory conditions include but are not limited to conditions such 
as growth in a fermenter. In a shalce flask, plated media, etc., suitable for the growth of the 
microorganism. 
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As used herein, a "corresponding unaltered Sac///i/s strain" is the host strain (e.g., 
the originating and/or wild-^type strain) from which the Indigenous chromosomal region or 
fragment thereof is deieted or modified and from which the altered strain is derived. 

As used herein, the term "chromosomal integration" refers to the process whereby 
the incoming sequence is introduced into the chromosome of a host cell (e.g., Bacillus). 
The homologous regions of the transfomiing DNA align with homologous regions of the 
chromosome. Subsequently, the sequence between the homology boxes is replaced by 
the incoming sequence in a double crossover {I.e., homologous recombination). In some 
embodiments of the present invention, homologous sections of an inactivating 
chromosomal segment of a DNA construct align with the flanking homologous regions of 
the indigenous chromosomal region of the Bacillus chromosome. Subsequently, the 
indigenous chromosomal region Is deleted by the DNA constmct in a double crossover 
(/.e., homologous recombination). 

"Homologous recombination' means the exchange of DNA fragments between two 
DNA molecules or paired chromosomes at the site of identical or neariy identical 
nucleotide sequences. In a prefenred embodiment, chromosomal integration is 
homologous recombination. 

"Homologous sequences" as used herein means a nucleic acid or polypeptide 
sequence having 100%, 99%. 98%, 97%, 96%. 95%. 94%. 93%. 92%. 91%. 90%, 88%, 
85%, 80%. 75%, or 70% sequence identity to another nucleic acid or polypeptide 
sequence when optimally aligned for comparison. In some embodiments, homologous 
sequences have between 85% and 100% sequence identity, while In other embodiments 
there is between 90% and 100% sequence identity, and in more preferred embodiments, 
there is 95% and 100% sequence identity. 

As used herein "amino acid" refers to peptide or protein sequences or portions 
thereof. The terms "protein", "peptide" and "polypeptide" are used interchangeably. 

As used herein, "protein of interest" and "polypeptide of interest" refer to a 
protein/polypeptide that is desired and/or being assessed. In some embodiments, the 
protein of interest is intracellular, while in other embodiments, it is a secreted polypeptide. 
Particularly preferred polypeptides include enzymes, including, but not limited to those 
selected from amylolytic enzymes, proteolytic enzymes, cellulytic enzymes, 
oxidoreductase enzymes and plant cell-wall degrading enzymes. More particularly, these 
enzyme include, but are not limited to amylases, proteases, xylanases. lipases, laccases. 
phenol oxidases, oxidases, cutinases, cellulases, hemicellulases. esterases, perioxidases, 
catalases, glucose oxidases, phytases, pectinases. glucosidases, isomerases, 
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transferases, galactosidases and chitinases. In some particularly preferred embodiments 
of the present invention, the polypeptide of interest is a protease. In some embodiments, 
the protein of interest is a secreted polypeptide which is fused to a signal peptide {I.e., an 
amino-terminal extension on a protein to be secreted). Nearly all secreted proteins use an 
amino- terminal protein extension which plays a crucial role in the targeting to and 
translocation of precursor proteins across the membrane. This extension is proteolytically 
removed by a signal peptidase during or immediately following membrane transfer. 

In some embodiments of the present invention, the polypeptide of interest is 
selected from homnones, antibodies, growth factors, receptors, etc. Hormones 
encompassed by the present invention include but are not limited to, follicle-stimulating 
hormone, luteinizing hormone, corticotropin-releasing factor, somatostatin, gonadotropin 
hormone, vasopressin, oxytocin, erythropoietin, insulin and the like. Growth factors 
include, but are not limited to platelet-derived growth factor, insulin-like growth factors, 
epidennal growth factor, nerve growth factor, fibroblast growth factor, transforming growth 
factors, cytokines, such as interleukins (e.g., IL-1 through IL-13). interferons, colony 
stimulating factors, and the like. Antibodies include but are not limited to immunoglobulins 
obtained directly from any species from which it is desirable to produce antibodies. In 
addition, the present invention encompasses modified antibodies. Polyclonal and 
monoclonal antibodies are also encompassed by the present Invention. In particularly 
prefered embodiments, tiie antibodies are human antibodies. 

As used herein, the temri "heterologous protein" refers to a protein or polypeptide that 
does not naturally occur in the host cell. Examples of heterologous proteins include enzymes 
such as hydrolases including proteases, cellulases, amylases, carbohydrases, and lipases; 
isomerases such as racemases, epimerases, tautomerases, or mutases; transferases, 
kinases and phophatases. In some embodiments, the proteins are therapeutically 
significant proteins or peptides, including but not limited to growth factors, cytokines, ligands, 
receptors and inhibitors, as well as vaccines and antibodies. In additional embodiments, the 
proteins are commercially important industrial proteins/peptides (e.g.,. proteases, 
carbohydrases such as amylases and glucoamylases, cellulases, oxidases and lipases). In 
some embodiments, the gene encoding the proteins are naturally occurring genes, while In 
other embodiments, mutated and/or synthetic genes are used. 

As used herein, "homologous protein" refers to a protein or polypeptide native or 
naturally occurring in a cell. In preferred embodiments, the cell Is a Gram-positive cell, 
while In particulariy prefen^d embodiments, the cell Is a Bacillus host cell. In alternative 
embodiments, the homologous protein is a native protein produced by other organisms. 
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including but not limited to E coll. The invention encompasses host cells producing the 
homologous protein via recombinant DMA technology. 

As used herein, an "operon region" comprises a group of contiguous genes that 
are transcribed as a single transcription unit from a common promoter, and are thereby 
subject to coHreguiation. In some embodiments, the operon Includes a regulator gene. In 
most preferred embodiments, operonslhat are highly expressed as measured by RNA 
levels, but have an unknown or unnecessary function are used. 

As used herein, a "multi-contiguous single gene region" is a region wherein at least 
the coding regions of two genes occur in tandem and in some embodiments, include 
Intervening sequences preceding and following the coding regions. In some 
embodiments, an antimicrobial region is included. 

As used herein, an "antimicrobial region" is a region containing at least one gene 
that encodes an antimicrobial protein. 

DETAILED DESCRIPTION OF THE INVENTION 

The present Invention provides cells that have been genetically manipulated to 
have an altered capacity to produce expressed proteins. In particular, the present 
Invention relates to Gram-positive microorganisms, such as Bacillus species having 
enhanced expression of a protein of interest, wherein one or more chromosomal genes 
have been Inactivated, and preferably wherein one or more chromosomal genes have 
been deleted from the Bacillus chromosome. In some further embodiments, one or more 
indigenous chromosomal regions have been deleted from a corresponding wild-type 
Bacillus host chromosome. Indeed, the present invention provides means for deletion of 
single or multiple genes, as well as large chromosomal deletions. In preferred 
embodiments, such deletions provide advantages such as improved production of a 
protein of interest. 

A. Gene Deletions 

As Indicated above, the present invention includes embodiments that involve singe 
or multiple gene deletions and/or mutations, as well as large chromosomal deletions. 

In some preferred embodiments, the present invention includes a DNA construct 
comprising an incoming sequence. Tlie DNA construct is assembled in vitro, followed by 
direct cloning of the construct into a competent Bacillus host, such that the DNA construct 
becomes integrated into the Bacillus chromosome. For example. PGR fusion and/or 
ligation can be employed to assemble a DNA construct in vitro. In some embodiments. 
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the DNA constnjct is a non-plasmid construct, while in other embodiments it is 
incorporated into a vector (e.g., a plasmid). In some embodiments, circular plasmids are 
used. In preferred embodiments, circular plasmids are designed to use an appropriate 
restriction enzyme (/.©., one that does not disrupt the DNA construct). Thus, linear 
plasmids find use in the present invention (See, Figure 1). However, other methods are 
suitable for use in the present invention, as known to those in the art (See e.g., Perego, 
"Integrational Vectors for Genetic Manipulation in Bacillus subtllis" In (Sonenshein etal. 
(eds .). Bacillus subtills and Other Gram-Positive Bacteria. American Society for 
IVIicrobiology, Washington, DC [1993]). 

In some embodiments, the incoming sequence includes a selective marker. In 
some prefenred embodiments, the incoming sequence includes a chromosomal gene 
selected from the group consisting of sbo, sir, ybcO, csn, spollSA, phrC, sigB, rapA, CssS. 
trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, alsD, sigD, prpC, gapB, pckA, fbp, rocA, ycgN, 
ycgM, rocF, and rocD or fragments of any of these genes (alone or in combination). In 
additional embodiments, the incoming sequence includes a homologous sbo, sir, ybcO, 
est), spollSA, phrC, slgB, rapA, CssS trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, alsD, sigD, 
prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and/or rocD gene sequence. A 
homologous sequence is a nucleic acid sequence having at least 99%, 98%, 97%, 96%, 
95%, 94% 93%, 92%, 91%, 90%, 88%. 85% or 80% sequence identity to a sbo, sir, ybcO, 
csn, spollSA, phrC, sIgB, rapA, CssS trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, alsD, sigD, 
prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and rocD gene or gene fragment thereof, 
which may be included in the incoming sequence. In preferred embodiments, the 
incoming sequence comprising a homologous sequence comprises at least 95% 
sequence identity to a sbo, sir, ybcO, csn, spollSA, phrC, sIgB, rapA, CssS trpA, trpB, 
trpC, trpD, trpE, trpF, tdh/kbl, alsD, sigD, prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, 
or rocD gene or gene fragment of any of these genes. In yet other embodiments, the 
incoming sequence comprises a selective marker flanked on the 5' and 3' ends with a 
fragment of the gene sequence. In some embodiments, when the DNA construct 
comprising the selective marker and gene, gene fragment or homologous sequence 
thereto is transfomied into a host cell, the location of the selective mariner renders the 
gene non-functional for its intended purpose. In some embodiments, the incoming 
sequence comprises the selective marker located in the promoter region of the gene. In 
other embodiments, the incoming sequence comprises the selective marker located after 
the promoter region of gene. In ypt other embodiments, the incoming sequence 
comprises the selective maricer located in the coding region of the gene. In further 
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embodiments, the incoming sequence comprises a selective marker flanlced by a 
liomology box on both ends. In still furtiier embodiments, the incoming sequence includes 
a sequence that intermpts the transcription and/or translation of ttie coding sequence. In 
yet additional embodiments, Uie DNA construct includes restriction sites engineered at ttie 
upstream and downstream ends of the constaxict. 

Whether the DNA construct is incorporated into a vector or used wittiout ttie 
presence of plasmid DNA. It is used to transfomn microorganisms. It Is contemplated tiiat 
any suitable method for transfomnation will find use witti ttie present invention. In 
prefened embodiments, at least one copy of the DNA construct is integrated into the host 
Bacillus chromosome. In some embodiments, one or more DNA constructs of the 
invention are used to transfonn host cells. For example, one DNA construct may be used 
to inactivate a sir gene and another construct may be used to inactivate a phrC gene. Of 
course, additional combinations are contemplated and provided by ttie present invention. 

In some prefen'ed embodiments, the DNA construct also includes a polynucleotide 
encoding a protein of interest. In some of these preferred embodiments, ttie DNA 
oonstmct also includes a constitutive or inducible promoter ttiat is operably linked to ttie 
sequence encoding ttie protein of interest. In some preferred embodiments in which tiie 
protein of interest Is a protease, the promoter is selected from the group consisting of a tac 
promoter, a P-lactamase promoter, or an aprE promoter (DeBoer ef a/., Proc. Nati. Acad. 
Scl. USA 80:21-25 [1983]). However, it is not intended that the present invention be 
limited to any particular promoter, as any suitable promoter known to those In the art finds 
use witti the present invention. Nonetheless, in particularly preferred embodiments, the 
promoter is the S. subtilis aprE promoter. 

Various methods are known for the transformation of Bacillus species. Indeed, 
methods for altering the chromosome of Bacillus involving plasmid constnjcts and 
transformation of the plasmids into £ co// are well known. In most methods, plasmids are 
subsequentiy isolated from E. co// and transformed Into Bacillus. However, it is not 
essential to use such intervening microorganisms such as E. co//, and in some preferred 
embodiments, the DNA construct Is directiy transformed into a competent Bacillus host. 

In some embodiments, the well-known Bacillus subtilis strain 168 finds use in the 
present invention. Indeed, the genome of this strain has been well-characterized (See, 
Kunst et al., Nature 390:249-256 [1 997]; and Henner et aL, Microbiol. Rev.. 44:57-82 
[1980]). The genome is comprised of one 4215 kb chromosome. While tiie coordinates, 
used herein refer to the 168 strain, ttie invention encompasses analogous sequences from 
Bacillus strains. 
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In some embodiments, the incoming chromosomal sequence includes one or more 
genes selected from the group consisting of sbo, sir, ybcO, csn, spollSA, sigB, phrC, rapA, 
CssS, trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, a/sD, sigD, prpC, gapB, pckA, fbp, rocA, 
ycgN, ycgM, rocF, and rocD gene fragments thereof and homologous sequences thereto. 
The DNA coding sequences of these genes from fi. subtilis 168 are provided In SEQ ID NO: 
1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7. SEQ ID NO: 9, SEQ ID NO: 11. SEQ ID 
NO: 13, SEQ ID NO: 15, SEQ ID NO:17, SEQ ID NO:39. SEQ ID NO:40, SEQ ID NO:42. 
SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:37. SEQ ID 
NO:25. SEQ ID N0:21. SEQ ID NO:50. SEQ ID NO:29, SEQ ID NO:23, SEQ ID NO:27. SEQ 
ID N0:19, SEQ ID N0:31. SEQ ID NO:48. SEQ ID NO:46, SEQ ID NO:35, and SEQ ID 
NO:33. 

As mentioned above, in some embodiments, the incoming sequence which 
comprises a sbo, sir, ybcO, csn, spollSA, sigB, phrC, rapA, CssS, trpA, trpB, trpC, trpD, 
trpE, trpF, tdh, kbi, a/sD, sIgD, prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and rocD 
gene, a gene fragment thereof, or a homologous sequence thereto includes the coding 
region and may further include immediate chromosomal coding region flanking 
sequences. In some embodiments the coding region flanking sequences include a range 
of about 1bp to 2500 bp; about Ibp to 1500 bp, about 1 bp to 1000 bp, about 1 bp to 500 
bp, and 1 bp to 250 bp. The number of nucleic acid sequences comprising the coding 
region flanking sequence may be different on each end of the gene coding sequence. 
For example, in some embodiments, the 5' end of the coding sequence includes less 
than 25 bp and the 3' end of the coding sequence includes more than 100 bp. 
Sequences of these genes and gene products are provided below. The numbering used 
herein is that used in subtilist (See e.g., Moszer etal., IVIicrobiol., 141:261-268 [1995]). 

The sbo coding sequence of B. subtilis 168 is shown below: 

. ATGAAAAAAGCTGTCATTGTAGAAAACAAAGGTTGTGCAACATGCTCGATCGGAGCCG 
CTTGTCTAGTGGACGGTCCTATCCCTGATTTTGAAATTGCCGGTGCAACAGGTCTATTC 
GGTCTATGGGGG (SEQ ID N0:1). 

The deduced amino acid sequence for Sbo is: 
MKKAVIVENKGCATCSIGAACLVDGPIPDFEIAGATGLFGLWG (SEQ ID NO: 2). 

In one embodiment, the gene region found at about 3834868 to 383521 9 bp of the 
fi. subtilis 168 chromosome was deleted using the present invention. The sbo coding 
region found at about 3835081 to 3835209 produces subtllisin A, an antimicrobial that has 
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activity against some Gram-positive bacteria. {See, Zheng etal., J. Bacteriol.. 181 :7346- 
7355 [1994]). 



The sir coding sequence of B. subtilis 168 is shown betovr. 

ATGATTGGAAGAATTATCCGTTTGTACCGTAAAAGAAAAGGCTATTCTATTAATCAGCTG 

GCTGTTGAGTCAGGCGTATCCAAATCCTATTTAAGCAAGATTGAAAGAGGCGTTCACAC 

GAATCCGTCCGTTCAATTTTTAAAAAAAGTTTCTGCCACACTGGAAGTTGAATTAACAGA 

ATTATTTGACGCAGAAACAATGATGTATGAAAAAATCAGCGGCGGTGAAGAAGAATGGC 

GCGTACATTTAGTGCAAGCCGTACAAGCCGGGATGGAAAAGGAAGAATTGTTCACTTTT 

ACGAACAGACTCAAGAAAGAACAGCCTGAAACTGCCTCTTACCGCAACCGCAAACTGA 

CGGAATCCAATATAGAAGAATGGAAAGCGCTGATGGCGGAGGCAAGAGAAATCGGCTT 

GTCTGTCCATGAAGTCAAATCCTTTTTAAAAACAAAGGGAAGA (SEQ ID NO:3). 

The deduced amino acid sequence for Sir is: 

MIGRIIRLYRKRKGYSiNQ!_AVESGVSKSYLSKIERGVHTNPSVQFLKKVSATLEVELTELF 
DAETiy/IIVIYEKISGGEEEWRVHLVQAVQAGiy/IEKEELFTI=TNRLKKEQPETASYRNRKLTES 
NiEEWKAUy/IAEAREIGLSVHEVKSFLKTKGR (SEQ ID NO: 4). 



In one eml)odiment, the sequence found at about 3529014 - 3529803 bp of f/ie B. 
subtilis 168 chromosome v^s deleted using the present invention. The slir coding 
sequence is found at about 3529131 to 3529586 of the chromosome. 



The phrC coding sequence of S. subtilis 168 is provided below: 

ATGAAATTGAAATCTAAGTTGTTTGTTATTTGTTTGGCCGCAGCCGCGATTTTTACAGCG 
GCTGGGGTTTCTGCTAATGCGGAAGCACTCGACTTTCATGTGACAGAAAGAGGAATGA 
GG (SEQ ID NO :13). 

The deduced amino acid sequence for PhrC is: 
MKLKSKLFVICLAAAAIFTAAGVSANAEALDFHVTERGMT (SEQ ID NO: 14) 



Additionally, the coding region found at about 429531 to 429650 bp of the B. 
subtilis 168 chromosome was inactivated by an insertion of a selective marker at 429591 
of the coding sequence. 

The sigB coding sequence of B. subtilis 168 is shown below: 

TTGATCATGACACAACCATCAAAAACTACGAAACTAACTAAAGATGAAGTCGATCGGCT 

CATAAGCGATTACCAAACAAAGCAAGATGAAGAAGCGCAGGAAACGCTTGTGCGGGTG 

TATACAAATCTGGTTGACATGCTTGCGAAAAAATACTCAAAAGGCAAAAGCTTCCACGA 

GGATCTCCGCCAGGTCGGCATGATCGGGCTGCTAGGCGCGATTAAGCGATACGATCC 

TGTTGTCGGCAAATCGTTTGAAGCTTTTGCAATCCCGACAATCATCGGTGAAATTAAAC 

GTTTCCTCAGAGATAAAACATGGAGCGTTCATGTGCCGAGACGAATTAAAGAACTCGGT 

CCAAGAATCAAAATGGCGGTTGATCAGCTGACCAGTGAAACACAAAGATCGCCGAAAG 

TCGAAGAGATTGCCGAATTCCTCGATGTTTCTGAAGAAGAGGTTCTTGAAACGATGGAA 

ATGGGCAAAAGCTATCAAGCCTTATCCGTTGACCACAGCATTGAAGCGGATTCGGACG 

GAAGCACTGTCACGATTCTTGATATCGTCGGATCACAGGAGGACGGATATGAGCGGGT 
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CAACCAGCAATTGATGCTGCAAAGCGTGCTTCATGTCCTTTCAGACCGTGAGAAACAAA 
TCATAGACCTTACGTATATTCAAAACAAAAGCCAAAAAGAAACTGGGGACATTCTCGGT 
ATATCTCAAATGCACGTCTCGCGCTTGCAACGCAAAGCTGTGAAGAAGCTCAGAGAGG 
CCTTGATTGAAGATCCCTCGATGGAGTTAATG (SEQ ID N0:9). 

The deduced amino acid sequence for SigB is: 

IVIiMTQPSKTTKLTKDEVDRLISDYQTKQDEQAQETLVRVYTNLVDMI^KKYSKGKSFHED 
LRQVGMIGLLGAIKRYDPWGKSFEAFAiPTilGEiKRFLRDKTWSVHVPRRIKELGPRIKMA 
VDQLTTETQRSPKVEEIAEFLDVSEEEVLETMEIVIGKSYQALSVDHSIEADSDGSTVTIU3I 
VGSQEDGYERVNQQLiy/ILQSVLHVLSDREKQIIDLTYIQNKSQKETGDILGISQMHVSRLQ 
RKAVKKLREALIEDPSI\/IELM (SEQ ID NO: 10). 



Additionally, the coding sequence is found at about 522417 to 5232085 bp of the 

B. subtilis 168 chromosome. 

The spollSA coding sequence of B. subtilis 168 is shown below: 

ATGGTTTTATTCTTTCAGATCATGGTCTGGTGCATCGTGGCCGGACTGGGGTTATACGT 

GTATGCCACGTGGCGTTTCGAAGCGAAGGTCAAAGAAAAAATGTCCGCCATTCGGAAA 

ACTTGGTATTTGCTGTTTGTTCTGGGCGCTATGGTATACTGGACATATGAGCCCACTTC 

CCTATTTACCCACTGGGAACGGTATCTCATTGTCGCAGTCAGTTTTGCTTTGATTGATG 

CTTTTATCTTCTTAAGTGCATATGTCAAAAAACTGGCCGGCAGCGAGCTTGAAACAGAC 

ACAAGAGAAATTCTTGAAGAAAACAACGAAATGCTCCACATGTATCTCAATCGGCTGAA 

AACATACCAATACCTATTGAAAAACGAACCGATCCATGTTTATTATGGAAGTATAGATGC 

TTATGCTGAAGGTATTGATAAGCTGCTGAAAACCTATGCTGATAAAATGAACTTAACGG 

CTTCTCTTTGCCACTATTCGACACAGGCTGATAAAGACCGGTTAACCGAGCATATGGAT 

GATCCGGCAGATGTACAAACACGGCTCGATCGAAAGGATGTTTATTACGACCAATACG 

GAAAAGTGGTTCTCATCCCTTTTACCATCGAGACACAGAACTATGTCATCAAGCTGACG 

TCTGACAGCATTGTCACGGAATTTGATTATTTGCTATTTACGTCATTAACGAGCATATAT 

GATTTGGTGCTGCCAATTGAGGAGGAAGGTGAAGGA (SEQ ID N0:1 1). 

The deduced amino add sequence for SpollSA is: 

MVLFFQIMVWCiVAGLGLYWATWRFEAKVKEKiVISAIRKTWYLLFVLGAMVYWTYEPTSL 
FTHWERYLiVAVSFALIDAFIFLSAYVKKLAGSELETDTREILEENNEMLHMYLNRLKTYQY 
LLKNEPIHVYYGSIDAYAEGIDKLLKTYADKMNLTASLCHYSTQADKDRLTEHI\/IDDPADV 
QTRLDRKDVYYDQYGKWLIPFTIETQNYVIKLTSDSIVTEFDYLLFTSLTSiYDLVLPIEEEG 
EG (SEQ ID NO: 12). 

Additionally, the coding region is found at about 1347587 to 1348714 bp of the B. 
subtilis 168 chromosome. 



The csn coding sequence of B. subtilis 168 Is shown below: 

ATGAAAATCAGTATGCAAAAAGCAGATTTTTGGAAAAAAGCAGCGATCTCATTACTTGTT 

TTCACCATGTTTTTTACCCTGATGATGAGCGAAACGGTTTTTGCGGCGGGACTGAATAA 

AGATCAAAAGCGCCGGGCGGAACAGCTGACAAGTATCTTTGAAAACGGCACAACGGA 

GATCCAATATGGATATGTAGAGCGATTGGATGACGGGCGAGGCTATACATGCGGACGG 

GCAGGCTTTACAACGGCTACCGGGGATGCATTGGAAGTAGTGGAAGTATACACAAAGG 

CAGTTCCGAATAACAAACTGAAAAAGTATCTGCCTGAATTGCGCCGTCTGGCCAAGGA 

AGAAAGCGATGATACAAGCAATCTCAAGGGATTCGCTTCTGCCTGGAAGTCGCTTGCA 

AATGATAAGGAATTTCGCGCCGCTCAAGACAAAGTAAATGACCATTTGTATTATCAGCC 
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TGCCATGAAACGATCGGATAATGCCGGACTAAAAACAGCATTG GCAAGA GCTGTGATG 

TACGATACGGTTATTCAGCATGGCGATGGTGATGACCCTGACTCTTTTTATGCCTTGAT 

TAAACGTACGAACAAAAAAGCGGGCGGATCACCTAAAGACGGAATAGACGAGAAGAAG 

TGGTTGAATAAATTCTTGGACGTACGCTATGACGATCTGATGAATCCGGCCAATCATGA 

CACCCGTGACGAATGGAGAGAATCAGTTGCCCGTGTGGACGTGCTTCGCTGTATCGCC 

AAGGAGAACAAGTATAATCTAAACGGACCGATTCATGTTCGTTCAAAGGAGTACGGTAA 

TTTTGTAATCAAA (SEQ ID N0:7). 



The deduced amino add sequence for Csn is: 

MKiSIVIQI<ADFWKKAAISLLVFTiy/IFi^LMiy/lSETVFAAGLNKDQKRRAEQLTSiFENGTTEIQ 
YGYVERLDDGRGYTCGRAGFTTATGDALEWEVYTKAVPNNKLKKYLPELRRLAKEESD 
DTSNLKGFASAWKSlJ\NDKEFRAAQDKVNDHLYYQPAIVIKRSDNAGLKTAI.ARAViy/IYDT 
VIQHGDGDDPDSFYALIKRTNKKAGGSPKDGIDEKKWLNKFLDVRYDDLIVINPANHDTRD 
EWRESVARVDVLRSIAKENNYNLNGPiHVRSNEYGNFVIK(SEQ ID NO: 8). 

Additionally, the coding region is found at about 2747213 to 2748043 bp of the 6. 

subtllls 168 chromosome. 

The ybcO coding sequence of B. subtilis 168 is shown below: 

ATGAAAAGAAACCAAAAAGAATGGGAATCTGTGAGTAAAAAAGGACTTATGAAGCCGG 
GAGGTACTTCGATTGTGAAAGCTGCTGGCTGCATGGGCTGTTGGGCCTCGAAGAGTAT 
TGCTATGACACGTGTTTGTGCAGTTCGGCATCCTGCTATGAGAGCTATT (SEQ ID N0:5). 



The deduced amino acid sequence for YbcO is: 

IVIKRNQKEWESVSKKGLMKPGGTSIVKAAGCMGCWASKSIAiy/ITRVCALPHPAIVIRAI 
(SEQ ID NO: 6). 

Additionally, the coding region is found at about 21 3926 to 21 4090 bp of the B. 

subtllls 168 chromosome. 

The rapA coding sequence of 6. subtilis 168 is shown below: 

TTGAGGATGAAGCAGACGATTCCGTCCTCTTATGTCGGGCTTAAAATTAATGAATGGTA 

TACTCATATCCGGCAGTTCCACGTCGCTGAAGCCGAACGGGTCAAGCTCGAAGTAGAA 

AGAGAAATTGAGGATATGGAAGAAGACCAAGATTTGCTGCTGTATTATTCTTTAATGGA 

GTTCAGGCACCGTGTCATGGTGGATTACATTAAGCCTTTTGGAGAGGACACGTCGCAG 

CTAGAGTTTTCAGAATTGTTAGAAGACATCGAAGGGAATCAGTACAAGCTGACAGGGCT 

TCTCGAATATTACTTTAATTTTTTTCGAGGAATGTATGAATTTAAGCAGAAGATGTTTGTC 

AGTGCCATGATGTATTATAAACGGGCAGAAAAGAATCTTGCCCTCGTCTCGGATGATAT 

TGAGAAAGCAGAGTTTGCTTTTAAAATGGCTGAGATTTTTTACAATTTAAAACAAACCTA 

TGTTTCGATGAGCTACGCCGTTCAGGCATTAGAAACATACCAAATGTATGAAACGTACA 

CCGTCCGCAGAATCCAATGTGAATTCGTTATTGCAGGTAATTATGATGATATGCAGTAT 

CCAGAAAGAGCATTGCCCCACTTAGAACTGGCTTTAGATCTTGCAAAGAAAGAAGGCA 

ATCCCCGCCTGATCAGTTCTGCCCTATATAATCTCGGAAACTGCTATGAGAAAATGGGT 

GAACTGGAAAAGGCAGCCGAATACTTTGGGAAATCTGTTTCTATTTGCAAGTCGGAAAA 

GTTCGATAATCTTCCGCATTCTATCTACTCTTTAACACAAGTTCTGTATAAACAAAAAAAT 

GACGCCGAAGCGCAAAAAAAGTATCGTGAAGGATTGGAAATCGCCCGTCAATACAGTG 

ATGAATTATTTGTGGAGCTTTTTCAATTnTACATGCGTTATACGGAAAAAACATTGACA 

CAGAATCAGTCTCACAGACCTTTCAATTTCTTGAAGAACATATGCTGTATCCTTATATTG 

AAGAGCTGGCGCATGATGCTGCCCAATTGTATATAGAAAACGGACAGCCCGAAAAAGC 
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ACTTTCATTTTATGAGAAAATGGTGCACGCACAAAAACAAATCCAGAGAGGAGATTGTT 
TATATGAAATC (SEQ ID NO :15). 

The deduced amino acid sequence for RapA is: 

l\/IRIVIKQTIPSSYVGLKINEWYTHIRQFHVAEAERVKLEVEREIEDI^EEDQDLLLYYSLMEF 

RHRVMLDYIKPFGEDTSQLEFSELLEDIEGNQYKLTGLLEYYFNFFRGIVIYEFKQKMFVSA 

IVIIViYYKRAEKNLALVSDDIEKAEFAFKIVIAEIFYNLKQTYVSIVISYAVQALETYQIVIYETYTVR 

RIQCEFVIAGNYDDIViQYPERALPHLELALDLAKKEGNPRLISSALYNLGNCYEKMGELQK 

AAEYFGKSVSICKSEKFDNLPHSIYSLTQVLYKQKNDAEAQKKYREGLEIARQYSDELFVE 

LFQFLHALYGKNIDTESVSHTFQFLEEHMLYPYIEELAHDAAQFYIENGQPEKALSFYEKM 

VHAQKQIQRGDCLYEI (SEQ ID NO: 16) 



Additionally, the .coding region is found at about 1315179 to 1316312 bp of the B. 

subtilis 168 chromosome. 

The Css coding sequence of B. subtilis 168 is shown below: 

ATGAAAAACAAGCCGCTCGCGTTTCAGATATGGGTTGTCATATCCGGCATCCTGTTAG 

CGATATCGATTTTACTGCTTGTGTTATTTTCAAACACGCTGCGAGAI 1 1 1 1 ICACTAAT 

GAAACGTATACGACGATTGAAAATGAGCAGCATGTTCTGACAGAGTACCGCCTGCCA 

GGTTCGATTGAAAGGCGCTATTACAGCGAGGAAGCGACGGCGCCGACAACTGTCCG 

CTCCGTACAGCACGTGCTCCTTCCTGAAAATGAAGAGGCTTCTTCAGACAAGGATTTA 

AGCATTCTGTCATCTTCATTTATCCACAAGGTGTACAAGCT6GCTGATAAGCAGGAAG 

CTAAAAAGAAACGTTACAGCGCCGACGTCAATGGAGAGAAAGTGTTTTTTGTCATTAA 

AAAGGGACTTTCCGTCAATGGACAATCAGCGATGATGCTCTCTTACGCGCTTGATTCT 

TA TCGG GACGATTTGGCCTATACCTTGTTCAAACAGCTTCTGTTTATTATAGCTGTCGT 

CATTTTATTAAGCTGGATTCCGGCTATTTGGCTTGCAAAGTATTTATCAAGGCCTCTTG 

TATCATTTGAAAAACACGTCAAACGGATTTCTGAACAGGATTGGGATGACCCAGTAAA 

AGTGGACCGGAAAGATGAAATCGGCAAATTGGGCCATACCATCGAAGAGATGCGCC 

AAAAGCTTGTGCAAAAGGATGAAACAGAAAGAACTCTATTGCAAAATATCTCTCATGA 

TTTAAAAACGCCGGTCATGGTCATCAGAGGCTATACACAATCAATTAAAGACGGGATT 

TTTCCTAAAGGAGACCTTGAAAACACTGTAGATGTTATTGAATGC6AAGCTCTTAAGC 

TGGAGAAAAAAATAAAGGATTTATTATAnTAACGAAGGTGGATTATTTAGCGAAGCAA 

AAAGTGCAGCACGACATGTTCAGTATTGTGGAAGTGACAGAAGAAGTCATCGAACGA 

TTGAAGTGGGCGCGGAAAGAACTATCGTGGGAAATTGATGTAGAAGAGGATATTTTG 

ATGCCGGGCGATCCGGAGCAATGGAACAAACTCCTCGAAAACATTTTGGAAAATCAA 

ATCCGCTATGCTGAGACAAAAATAGAAATCAGCATGAAACAAGATGATCGAAATATCG 

TGATCACCATTAAAAATGACGGTCCGCATATTGAAGATGAGATGCTCTCCAGCCTCTA 

TGAGCCTTTTAATAAAGGGAAGAAAGGCGAATTCGGCATTGGTCTAAGCATCGTAAAA 

CGAATTTTAACTCTTCATAAGGCATCTATCTCAATTGAAAATGACAAAACGGGTGTATC 

ATACCGCATAGCAGTGCCAAAA (SEQ ID N0:17). 



The deduced amino acid sequence for Css (GenBanl< Accession No. 032193) is: 

IVIKNKPLAFQI WWISGILLAISILLLVLFSNTLRDFFTNETYTTIENEQHVLTEYRLPGSIE 
RRYYSEEATAPTTVRSVQ HVLLPENEEASSDKDLSILS SSFIHKNA'KLADKQEAKKKR 
YSADVNGEKVFFVIKKGLSVNGQSAMMLSYALDSYRDDLAYTLFKQLLFilAWILLSWIPAl 
WLAKYLSRPLVSFEKHVKRISEQDWDDPVKVDRKDEIGKLGHTIEEIVIRQKLVQKDETER 
TLLQNISHDLKTPVMVIRGYTQSIKDGIFPKGDLENTVDVIECEALKLEKKIKDLLYLTKLDY 
LAKQKVQHDMFSIVEVTEEVIERLKWARKELSWEIVEEDILMPGDPEQWNKLLENILENQI 
RYAETKIEISMKQDDRNIVITIKNDGPHIEDEIVILSSLYEPFNKGKKGEFGIGLSIVKRILTLHK 
ASISIENDKTGVSYRIAVPK (SEQ ID N0:18). 
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Additionally, the gene region Is found at about 338461 2 to 3386774 bp of tlie B. 
saM//s 168 chromosome. 

The fbp coding sequence of the Fbp protein (fhjctose-1,6-biophosphatase) of a 
subtilis 168 is shown below: 

ATGTTTAAAMTAATGTCATACTTTTAAATTCACCTTATCATGCACATGCTCATAAAGA 

GGGGTTTATTCTAAAAAGGGGATGGACGGTTTTGGAAAGCAAGTACCTAGATCTACT 

CGCACAAAAATACGATTGTGAAGAAAAAGTGGTAACAGAAATCATCAATTTGAAAGCG 

ATATTGAACCTGCCAAAAGGCACCGAGGATTTTGTCAGTGATCTGCACGGAGAGTAT 

CAGGCATTCCAGCACGTGTTGCGCAATGGTTCAGGACGAGTCAAAGAGAAGATACG 

CGACATCTTCAGCGGTGTCATTTACGATAGAGAAATTGATGAATTAGCAGCATTGGTC 

TATTATCCGGAAGACAAACTGAAATTAATCAAACATGACTTTGATGCGAAAGAAGCGT 

TAAACGAGTGGTATAAAGAAACGATTCATCGAATGATTAAGCTCGTTTCATATTGCTC 

CTCTAAGTATACCCGCTCCAAATTACGCAAAGCACTGCCTGCCCAATTTGCTTATATT 

ACGGAGGAGCTGTTATACAAAACAGAACAAGCTGGCAACAAGGAGCAATATTACTCC 

GAAATCATrGATCAGATCATTGAACTTGGCCAAGCCGATAAGCTGATCACCGGCCTT 

GCTTACAGCGTTCAGCGATTGGTGGTCGACCATCTGCATGTGGTCGGCGATATTTAT 

GACCGCGGCCCGCAGCCGGATAGAATTATGGAAGAACTGATCAACTATCATTGTGTC 

GATATTCAGTGGGGAAATCACGATGTCCTTTGGATCGGCGCCTATTCCGGTTCCAAA 

GTGTGCCTGGCCAATATTATCCGCATCTGTGCCCGCTACGACAACCTGGATATTATTG 

AGGACGTGTACGGCATCAACCTGAGACCGCTGCTGAACCTGGCCGAAAAATATTATG 

ATGATAATCCAGCGTTCCGTGCAAAAGCAGACGAAAACAGG 

CCAGAGGATGAGATTAAGCAAATCACAAAAATCCATCAAGCGATTGCCATGATCCAAT 

TCAAGCTTGAGAGCCCGATTATCAAGAGACGGCCGAACTTTAATATGGAAGAGCGGC 

TGTTATTAGAGAAAATAGACTATGACAAAAATGAAATCACGCTGAACGGAAAAACATA 

TCAACTGGAAAACACCTGCTTTGCGACGATTAATCCGGAGCAGCCAGATCAGCTATT 

AGAAGAAGAAGCAGAAGTCATAGACAAGCTGCTATTCTCTGTCCAGCATTCCGAAAA 

GCTGGGCCGCCATATGAATTTTATGATGAAAAAAGGCAGCCTTTATTTAAAATATAAC 

GGCAACCTGTTGATTCACGGCTGTATTCCAGTTGATGAAAACGGCAATATGGAAACG 

ATGATGATTGAGGATAAACCGTATGCGGGCCGTGAGCTGCTCGATGTATTTGAACGA 

TTCTTGCGGGAAGCCTTTGCCCACCCGGAAGAAACCGATGACCTGGCGACAGATATG 

GCTTGGTATTTATGGACAGGCGAATACTCCTCCCTCTTCGGAAAACGCGCGATGACG 

ACATTTGAGGGGTATTTCATGAAAGAGAAGGAAACGCATAAAGAGAAGAAAAAGGCGT 

ATTATTATTTACGAGAAGAGGAGGCAACGTGCCGAAACATGCTGGCAGAATTCGGCG 

TCAATCCAGATGACGGGCATATCATCAAGGGCGATACACGTGTAAAAGAAATGGAAG 

GAGAAGAGGGAATGAAAGGAAACGGAAAAATGATCGTGATGGACGGCGGCTTGTGCA 

AAGCGTACCAATCCACAACAGGCATCGCCGGCTAGACGGTGCTATACAACTGGTACG 

GCATGGAGCTCGTCGCCCATAAACAGTTCAATTCCAAGGCAGAAGTCCTAAGCACCG 

GAACCGACGTCTTAACGGTCAAACGATTAGTGGACAAAGAGCTTGAGCGGAAGAAAG 

TGAAGGAAAGGAATGTGGGTGAGGAATTGTTGGAGGAAGTTGCGATTTTAGAGAGTT 

TGCGGGAGTATCGGTATATGAAG (SEQ ID N0:19). 



The deduced amino acid sequence of the Fbp protein is: 

MFKNNVILLNSPYHAHAHKEGFILKRGWTVLESKYLDLLAQKYDCEEKWTEIINU<AILNL 

PKGTEHFVSDLHGEYQAFQHVLRNGSGRVKEKIRDIFSGVIYDREIDELAALVYYPED 

KLKLIKHDFDAKEALNEWYKETIHRMIKLVSYCSSKYTRSKLRKALPAQFAYITEELLYK 

TEQAGNKEQYYSEIIDQIIELGQADKLITGLAYSVQRLWDHLHVVGDIYDRGPQPDRIM 

EELINYHSVDIQWGNHDVLWIGAYSGSKVCLANIIRIGARYDNLDIIEDVYGINLRPLLN 

LAEKYYDDNPAFRPKADENRPEDEIKQITKIHQAIAIVIIQFKLESPIIKRRPNFNI\/1EERLL 
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LEKIDYDKNEITLNGKTYQLENTCFATINPEQPDQLLEEEAEVIDKLLFSVQHSEKLGRH 

MNFMMKKGSLYLKYNGNLLIHGCIPVDENGNMETMMIEDKPYAGRELLDVFERFLREAF 

AHPEETDDLATDMAWYLWTGEYSSLFGKRAMTTFERYFIKEKETHKEKKNPYYYLREDE 

ATCRNILAEFGLNPDHGHIINGHTPVKEiEGEDPIKANGKMIVIDGGFSKAYQSTTGIAGYT 

LLYNSYGMQLVAHKHFNSKAEVLSTGTDVLTVKRLVDKELERKKVKETNVGEELLQEVAI 

LESLREYRYMK (SEQ ID NO:20). 

Additionally, the coding region is found at about 4127053 to 4129065 bp of the B. 
subtilis 168 chromosome. 



The alsD coding sequence of the alsD protein (alpha-acetolactate 
decarboxylase) of B. subtilis 168 is shown below: 



ATGAAACGAGAAAGCAACATTCAAGTGCTCAGCCGTGGTCAAAAAGATCAGCCTGTG 

AGCCAGATTTATCAAGTATCAACAATGACTTCTCTATTAGACGGAGTATATGACGGAG 

ATTTTGAACTGTCAGAGATTCCGAAATATGGAGACTTCGGTATCGGAACCTTTAACAA 

GCTT6ACGGAGAGCTGATTGGGTTTGACGGCGAATTTTACCGTCTTCGCTCAGACGG 

AACCGCGACACCGGTCCAAAATGGAGACCGTTCACCGTTCTGTTCATTTACGTTCTTT 

ACACCGGACATGACGCACAAAATTGATGCGAAAATGACACGCGAAGACTTTGAAAAA 

GAGATCAACAGCATGCTGCCAAGCAGAAACTTATTTTATGCAATTCGCATTGACGGAT 

TGTTTAAAAAGGTGCAGACAAGAACAGTAGAACTTCAAGAAAAACCTTACGTGCCAAT 

GGTTGAAGCGGTCAAAACACAGCCGATTTTCAACTTCGACAACGTGAGAGGAACGAT 

TGTAGGTTTCTTGACACCAGCTTATGCAAACGGAATCGCGGTTTCTGGGTATCACCTG 

CACTTCATTGACGAAGGACGCAATTCAGGCGGACACGTTTTTGAGTATGTGCTTGAG 

GATTGCACGGTTACGATTTCTCAAAAAATGAACATGAATCTCAGACTTCCGAACACAG 

CGGATTTCTTTAATGCGAATCTGGATAACCCTGATTTTGCGAAAGATATCGAAACAAC 

TGAAGGAAGCCCTGAA (SEQ ID N0:21 ). 

The deduced amino acid sequence AlsD protein sequence is: 

MKRESNIQVLSRGQKDQPVSQIYQVSTiy/ITSLLDGWDGDFELSEIPKYGDFGIGTFNKLD 
GELIGFDGEFYRLRSDGTATPVQNGDRSPFCSFTFFTPDMTHKIDAKMTREDFEKEINSIVI 
LPSRNLFYAIRIDGLFKKVQTRTVELQEKPYVPMVEAVKTQPIFNFDNVRGTIVGFLTPAYA 
NGIAVSGYHLHFIDEGRNSGGHVFDYVLEDCTVTISQKIVINMNLRLPNTADFFNANLDNPD 
FAKDIETTEGSPE (SEQ ID NO:22). 



Additionally, the coding region is found at about 3707829-3708593 bp of the B. 
subtilis 168 chromosome. 



The gapB coding sequence of the gapB protein (glyoeraldehyde-3-phosphate 
dehydrogenase) of 6. subtHls 168 is shown below: 

ATGAAGGTAAAAGTAGCGATCAACGGGTTTGGAAGAATCGGAAGAATGGTTTTTAGA 

AAAGCGATGTTAGACGATCAAATTCAAGTAGTGGCCATTAACGCCAGCTATTCCGCA 

GAAACGCTGGCTCATTTAATAAAGTATGACACAATTCACGGCAGATACGACAAAGAG 

GTTGTGGCTGGTGAAGATAGCCTGATCGTAAATGGAAAGAAAGTGCTTTTGTTAAACA 

GCCGTGATCCAAAACAGCTGCCTTGGCGGGAATATGATATTGACATAGTCGTCGAAG 

CAACAGGGAAGTTTAATGCTAAAGATAAAGCGATGGGCCATATAGAAGCAGGTGCAA 
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AAAAAGTGATTTTGACCGCTCCGGGAAAAAATGAAGACGTTACCATTGTGATGGGCG 

TAAATGAGGACCAATTCGACGCTGAGCGCCATGTCATTATTTCAAATGCGTCATGCAC 

GACAAATTGCCTTGCGCCTGTTGTAAAAGTGCTGGATGAAGAGTTTGGCATTGAGAG 

CGGTCTGATGACTACAGTTCATGCGTATACGAATGACCAAAAAAATATTGATAACCCG 

CACAAAGATTTGCGCCGGGCGCGGGCTTGCGGTGAATCCATCATTCCAACAACAACA 

GGAGCGGCAAAGGCGCTTTCGCTTGTGCTGCCGCATGTGAAAGGAAAACTTCACGG 

CCTCGCCTTGCGTGTGCCTGTTCCGAACGTCTCATTGGTTGATCTCGTTGTTGATCTG 

AAAACGGATGTTACGGCTGAAGAAGTAAACGAGGCATTTAAACGCGCTGCCAAAACG 

TCGATGTACGGTGTACTTGATTACTCAGATGAAGCGCTG 

GTTTCGACTGATTATAATAGGAATCCGCATTCAGCGGTCATTGACGGGCTTACAACAA 
TGGTAATGGAAGACAGGAAAGTAAAGGTGCTGGCGTGGTATGACAACGAATGGGGC 
TACTCCTGCAGAGTTGTTGATCTAATCCGCCATGTAGCGGCACGAATGAAACATCCG 
TCTGCTGTA (SEQ ID NO:23). 

The deduced amino acid sequence of the GapB protein is: 

l\/IKVKVAINGFGRIGRMVFRKAI\/ILDDQIQWAINASYSAETLAHLIKYDTIHGRYDKEWA 

GEDSLIVNGKKVLLLNSRDPKQLPWREYDIDIWEATGKFNAKDKAIVlGHiEAGAKKVILT 

APGKNEDVTIVMGVNEDQFDAERHVIISNASCTTNCLAPWKVLDEEFGIESGLMTTVHAY 

TNDQKNIDNPHKDLRRARACGESIIPTTTGAAKALSLVLPHLKGKLHGLALRVPVPNVSLV 

DLWDLKTDVTAEEVNEAFKRAAKTSMYGVLDYSDEPLVSTDYNTNPHSAVIDGLTTI\/IVM 

EDRKVKVLAWYDNEWGYSCRWDLIRHVAARMKHPSAV(SEQ ID NO:24). 



Additionally, the coding region is found at about 2g66075-2967094bp of the B. 
subtilis 168 chromosome. 

The KbI coding sequence of the Kb! protein (2-amino-3-l(etobutyrate CoA ligase) is 
shown below: 



ATGACGAAGGAATTTGAGTTTTTAAAAGCAGAGCTTAATAGTATGAAAGAAAACCATA 

CATGGCAAGACATAAAACAGCTTGAATCTATGCAGGGCCCATCTGTCACAGTGAATC 

ACCAAAAAGTCATTCAGCTATCTTCTAATAATTACCTCGGATTCACTTCACATCCTAGA 

CTCATCAACGCCGCACAGGAGGCCGTTCAGCAGTATGGAGCCGGCACCGGATCAGT 

GAGAACGATTGCGGGTACATTTACAATGCATCAAGAGCTTGAGAAAAAGCTGGCAGC 

CTTTAAAAAAACGGAGGCGGCACTTGTATTCCAATGAGGGTTCACAACAAACCAAGG 

CGTACTTTCAAGTATTCTATCAAAAGAGGACATTGTCATGTCAGATGAATTGAACCAT 

GCCTCTATTATTGACGGAATTCGACTGACAAAGGCGGATAAAAAGGTGTATCAGCAC 

GTCAATATGAGTGATTTAGAG CGGG TGCTGAGAAAGTCAATGAATTATCGGATGCGT 

CTGATTGTGACAGAGGGCGTATTTTCCATGGATGGCAACATAGCTCCTCTGCCTGATA 

TTGTAGAGCTCGCTGAGAAATATGACGCATTTGTGATGGTGGATGAGGCCCATGCAT 

GCGGAGTACTTGGCGAAAACGGCAGGGGAACGGTGAATCACTTGGGTCTTGAGGGC 

AGAGTGCATATTCAGGTCGGAACATTAAGCAAGGGAATCGGAGTGCTCGGCGGCTA 

GGCTGCAGGTTCAAAGGTGCTGATGGATTATTTGCGCCATAAAGGCCGTCCATTTTTA 

TTGAGCACATCTCATCCGCCGGCAGTCAGTGGAGCTTGTATGGAAGCGATTGATGTC 

TTGCTTGAAGAGCCGGAGCATATGGAGCGCTTGTGGGAGAATAGTGCCTATTTTAAA 

GCAAT GGTTGTGAAAATGGGTGTGACTCTGACGAAGAGTGAAACGCGGATTCTTCCT 

ATT TTAAT AGGTGATGAAGGTGTGGCAAAGCAATTTTCAGATCAGCTCCTTTCTCGCG 

GTGI I I I IGCCCAAAGTATCGTTTTGGCGACTGTAGCAAAGGGAAAAGCCAGAATTGG 

GAGGATTATAAGAGCAGAGCACAGGAAAGATGAACTGGATGAGGCGCTTGATGTCAT 

CGAAAAGACGGGAAAGGAGGTGCAGCTATTG (SEQ ID NO:25). 
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The deduced amino add sequence of the KbI protein is: 

MTKEFEFLKAELNSMKENHTWQDIKQLESIVIQGPSVTVNHQKVIQLSSNNYLGFTSHPRLI 

NAAQEAVQQYGAGTGSVRTIAGTFTMHQELEKKIJ\AFKKTEAALVFQSGFTTNQGVLSSI 

LSKEDIVISDELNHASIIDGIRLTKADKKVYQHVNMSDLERVLRKSiy/INYRIVIRLIVTDGVFS 

IViDGNlAPLPDIVELAEKYDAFVMVDDAHASGVLGENGRGTVNHFGLDGRVHIQVGTLSK 

AIGVLGGYAAGSKVLIDYLRHKGRPFLFSTSHPPAVTAACMEAIDVLLEEPEHMERLWEN 

TAYFKAMLVKIVIGLTLTKSETPILPILIGDEGVAKQFSDQLLSRGVFAQSIVFPTVAKGKARI 

RTIITAEHTKDELDQALDVIEKTAKELQLL (SEQ ID NO:26). 

Additionally, the coding region is found at about 1770787 - 1771962 bp of the S. 
subtilis 168 chromosome. 

The PckA coding sequence of the PckA (phosphoenolpyoivate carboxykinase) of 
B. subtilis 168 is shown betow: 

ATGAACTCAGTTGATTTGACCGCTGATTTACAAGCCTTATTAACATGTCCAAATGTGC 

GTCATAATTTATCAGCAGCACAGCTAACAGAAAAAGTCCTCTCCCGAAACGAAGGCAT 

TTTAACATCCACAGGTGCTGTTCGCGCGACAACAGGCGCTTACACAGGACGCTCACC 

TAAAGATAAATTCATCGTGGAGGAAGAAAGCACGAAAAATAAGATCGATTGGGGCCC 

GGTGAATCAGCCGATTTCAGAAGAAGCGTTTGAGCGGCTGTACAC6AAAGTTGTCAG 

CTATTTAAAGGAGCGAGATGAACTGTTTGTTTTCGAAGGATTTGCCGGAGCAGACGA 

GAAATACAGGCTGCCGATCACTGTCGTAAATGAGTTCGCATGGCACAATTTATTTGCG 

CGGCAGCTGTTTATCCGTCCGGAAGGAAATGATAAGAAAACAGTTGAGCAGCCGTTC 

ACCATTCTTTCTGCTCCGCATTTCAAAGCGGATCCAAAAACAGACGGCACTCATTCCG 

AAACGTTTATTATTGTCTCTTTCGAAAAGCGGACAATTTTAATCGGCGGAACTGAGTA 

TGCCGGTGAAATGAAGAAGTCCATTTTCTCCATTATGAATTTCCTGCTGCCTGAAAGA 

GATATTTTATCTATGCACTGCTCCGCCAATGTCGGTGAAAAAGGCGATGTCGCCCTTT 

TCTTCGGACTGTCAGGAACAGGAAAGACCACCCTGTCGGCAGATGCTGACCGCAAG 

CTGATCGGTGACGATGAACATGGCTGGTCTGATACAGGCGTCTTTAATATTGAAGGC 

GGATGCTACGCTAAGTGTATTCATTTAAGCGAGGAAAAGGAGCCGCAAATCTTTAAC 

GCGATCCGCTTCGGGTCTGTTCTCGAAAATGTCGTTGTGGATGAAGATACACGCGAA 

GCCAATTATGATGATTGCTTCTATACTGAAAACACGCGGGCAGCTTACCCGAT TCATA 

TGATTAATAACATCGTGACTCCAAGCATGGCCGGCCATCCGTCAGCCATTGTATTTTT 

GACGGCTGATGCCTTCGGAGTCCTGCCGCCGATCAGCAAAGTAACGAAGGAGCAGG 

TGATGTACCATTTTTTGAGCGGTTACACGAGTAAGCTTGCCGGAACCGAACGTGGTG 

TCACGTCTCCTGAAACGACGTTTTCTACATGCTTCGGCTCACCGTTCGTGCCGCTTCC 

TGCTCACGTCTATGCTGAAATGCTCGGCAAAAAGATCGATGAACACGGCGCAGACGT 

TTTCTTAGTCAATACCGGATGGACCGGGGGCGGCTACGGCACAGGCGAACGAATGA 

AGCTTTCTTACACTAGAGCAATGGTCAAAGCAGCGATTGAAGGCAAATTAGAGGATG 

CTGAAATGATAACTGACGATATTTTCGGCGTGCACATTCCGGCCCATGTTCCTGGCGT 

TCCTGATCATATCCTTCAGCCTGAAAACACGTGGACCAACAAGGAAGAATACAAAGAA 

AAAGCAGTCTACCTTGCAAATGAATTCAAAGAGAACTTTAAAAAGTTCGCACATACCG 

ATGCCATCGCCCAGGCAGGCGGCCCTCTCGTA(SEQ ID NO:27). 



The deduced amino add sequence of the PckA protein is: 

ly/INSVDLTADLQALLTCPNVRHNLSAAQLTEKVLSRNEGILTSTGAVRATTGAYTGRSPKD 

KFIVEEESTKNKIDWGPVNQPISEEAFERLYTKVVSYLKERDELFVFEGFAGADEKYRLPI 

TWNEFAWHNLFARQLFIRPEGNDKKTVEQPFTILSAPHFKADPKTDGTHSETFIIVSF 

EKRTILIGGTEYAGEMKKSIFSIMNFLLPERDILSMHCSANVGEKGDVALFFGLSGTGKT 

TLSADADRKLIGDDEHGWSDTGVFNIEGGCYAKCIHLSEEKEPQIFNAIRFGSVLENVW 
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DEDTREANYDDSFYTENTRAAYPIHMINNIVTPSMAGHPSAIVFLTADAFGVLPPISKLT 
KEQVMYHFLSGYTSKLAGTERGVTSPETTFSTCFGSPFLPLPAHVYAEMLGKKIDEHGAD 
VFLVNTGWTGGGYGTGERMKLSYTRAMVKAAIEGKLEDAEMITDDIFGLHIPAHVPGVPD 
HILQPENTWTNKEEYKEKAVYLANEFKENFKKFAHTDAIAQA6GPLV(SEQ ID NO:28). 

Additionally, the coding region is found at about 3128579-3130159 bp of the B. 
subtilis 168 chromosome. 

The prpC coding sequence of the prpC protein (protein phosphatase) ofB. 
s(/M//s 168 is shown below: 



TTGTTAACAGCCTTAAAAACAGATACAGGAAAAATCCGCCAGCATAATGAAGATGATG 

CGGGGATATTCAAGGGGAAAGATGAATTTATATTAGCGGTTGTCGCTGATGGCATGG 

GCGGCCATCTTGCTGGAGATGTTGCGAGCAAGATGGCTGTGAAAGCCATGGGGGAG 

AAATGGAATGAAGCAGAGACGATTCCAACTGCGCCCTCGGAATGTGAAAAATGGCTC 

ATTGAACAGATTCTATCGGTAAACAGCAAAATATACGATCACGCTCAAGCCCACGAAG 

AATGCCAAGGCATGGGGACGACGATTGTATGTGCACTTTTTACGGGGAAAACGGTTT 

CTGTTGCCCATATCGGAGACAGCAGATGCTATTTGCTTCAGGACGATGATTTCGTTCA 

AGTGACAGAAGACCATTCGCTTGTAAATGAACTGGTTCGCACTGGAGAGATTTCCAG 

AGAAGACGCTGAACATCATCCGCGAAAAAATGTGTTGACGAAGGCGCTTGGAACAGA 

CCAGTTAGTCAGTATTGACACCCGTTCCTTTGATATAGAACCCGGAGACAAACTGCTT 

CTATGTTCTGACGGACTGACAAATAAAGTGGAAGGCACTGAGTTAAAAGACATCCTG 

CAAAGCGATTCAGCTCCTCAGGAAAAAGTAAACCTGCTTGTGGACAAAGCCAATCAG 

AATGGCGGAGAAGACAACATTACAGCAGTTTTGCTTGAGCTTGCTTTACAAGTTGAAG 

AGGGTGAAGATCAGTGC (SEQ ID NO:29). 

I 

The deduced amino acid sequence of the prpC protein is: 

MLTALKTDTGKIRQHNEDDAGIFKGKDEFILAVVADGMGGHLAGDVASKI\/IAVKAMGEKW 
NEAETIPTAPSECEKWLIEQILSVNSKIYDHAQAHEECQGMGTTIVCALFTGKTVSVAHIG 
DSRCYLLQDDDFVQVTEDHSLVNELVRTGEISREDAEHHPRKNVLTKALGTDQLVSIDTR 
SFDIEPGDKLLLCSDGLTNKVEGTELKDILQSDSAPQEKVNLLVDKANQNGGEDNITAVLL 
ELALQVEEGEDQC (SEQ ID NO:30). 

Additionally, the coding region is found at about 1649684-1650445 bp of the B. 
subtilis 168 chromosome. 

The rocA coding sequence of the rocA protein (pyrroline-5 carboxylate 
dehydrogenase) of B. subtilis 168 is shown below: 

ATGACAGTCACATACGCGCACGAACCATTTACCGATTTTACGGAAGCAAAGAATAAAA 

CTGCATTTGGGGAGTCATTGGCCTTTGTAAACACTCAGCTCGGCAAGCATTATCCGC 

TTGTCATAAATGGAGAAAAAATTGAAACGGACCGCAAAATCATTTCTATTAACCCGGC 

AAATAAAGAAGAGATCATTGGGTACGCGTCTACAGCGGATCAAGAGCTTGCTGAAAA 

AGCGATGCAAGCCGCATTGCAGGCATTTGATTCCTGGAAAAAACAAAGACCGGAGCA 

CCGCGCAAATATTCTCTTTAAGGCAGCGGCTATTTTGCGCAGAAGAAAGCATGAATTT 

TCAAGCTATCTTGTGAAGGAAGCAGGAAAACCGTGGAAGGAAGCAGATGCGGACAC 

GGCTGAAGCGATAGACTTTTTAGAGTTCTACGCGCGCCAAATGTTAAAGCTCAAGGA 

AGGGGCTCCGGTGAAGAGCCGTGCTGGCGAGGTCAATCAATATCATTACGAAGCGC 

TTGGCGTCGGCATCGTCATTTCTCCATTTAACTTCCCGCTCGCGATTATGGCGGGAA 

CAGCGGTGGCAGCGATTGTGACAGGAAATACGATTCTCTTAAAACCGGCTGACGCAG 
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CCCCGGTAGTGGCAGCAAAATTTGTCGAGGTCATGGAGGAAGCGGGTCTGCCAAAC 

GGCGTTCTGAATTACATTCCGGGAGATGGTGCGGAGATCGGTGATTTCTTAGTTGAG 

CATCCGAAGACACGGTTTGTCTCATTTACAGGTTCCCGTGCAGTCGGCTGCCGGATT 

TATGAGCGAGCTGCCAAAGTGCAGCCGGGCCAAAAATGGCTCAAACGGGTAATTGC 

AGAAATGGGCGGAAAAGACACAGTGCTTGTCGACAAGGACGCTGATCTTGACCTTGC 

TGCATCCTCTATCGTGTATTCAGCATTTGGATATTCAGGACAGAAGTGTTCTGCGGGC 

TCCCGCGCGGTCATTCATCAGGATGTGTATGATGAAGTGGTGGAAAAAGCTGTGGCG 

CTGACCAAAACGCTGACTGTCGGCAATCCAGAAGATCCTGATACGTATATGGGTCCC 

GTGATTCATGAAGCATCCTACAACAAAGTGATGAAATACATTGAAATCGGCAAATCTG 

AAGGCAAGCTATTGGCCGGCGGAGAAGGCGATGATTCAAAAGGCTACTTTATTCAGC 

CGACGATCTTTGCAGATGTTGATGAAAACGCCCGCTTGATGCAGGAAGAAATTTTCG 

GCCCGGTTGTTGCGATTTGCAAAGCGCGTGATTTCGATCATATGCTGGAGATTGCCA 

ATAACACGGAATACGGATTAACAGGTGCGCTTCTGACGAAAAACCGTGCGCACATTG 

AACGGGCGCGCGAGGATTTCCATGTCGGAAACCTATATTTTAACAGAGGATGTACCG 

GAGCAATTGTCGGCTATCAGCCGTTCGGCGGTTTTAATATGTCAGGAACAGACTCAA 

AAGCAGGCGGTCCCGATTACTTAATTCTTCATATGCAAGCCAAAACAACGTCCGAAG 

CTnT(SEQIDNO:31). 

The deduced amino acid sequence of the RocA protein is: 

IVITVTYAHEPFTDFTEAKNKTAFGESU\FVNTQLGKHYPLVINGEKIETDRKIISINPANK 

EEIIGYASTADQELAEKAMQAALQAFDSWKKQRPEHRANILFKAAAILRRRKHEFSSYLV 

KEAGKPWKEADADTAEAIDFLEFYARQMLKLKEGAPVKSRAGEVNQYHYEALGVGIVISP 

FNFPLAIMAGTAVAAIVTGNTILLKPADAAPVVAAKFVEVIVIEEAGLPNGVLNYIPGDGAE1G 

DFLVEHPKTRFVSFTGSRAVGCRIYERAAKVQPGQKWLKRVIAEMGGKDTVLVDKDADL 

DLAASSIVYSAFGYSGQKCSAGSRAVIHQDVYDEWEKAVALTKTLTVGNPEDPDTYIVIG 

PVIHEASYNKVMKYIEIGKSEGKLIJ^GGEGDDSKGYFIQPTIFADVDENARLIVIQEEIFGPV 

VAICKARDFDHIVILEIANNTEYGLTGALLTKNRAHIERAREDFHVGNLYFNRGCTGAIVGY 

QPFGGFNiy/iSGTDSKAGGPDYLILHMQAKTTSEAF (SEQ ID NO:32). 

Additionally, the coding region is found at about 3877991-3879535 bp of the 6. 
subtilis 168 chromosome. 



The rocD coding sequence of the rocD protein (ornithine aminotransferase) of B. 
subtilis 168 is shown below: 



ATGACAGCTTTATCTAAATCCAAAGAAATTATTGATCAGACGTCTCATTACGGAGCCA 

ACAATTATCACCCGCTCCCGATTGTTATTTCTGAAGCGCTGGGTGCTTGGGTAAAGG 

ACCCGGAAGGCAATGAATATATGGATATGCTGAGTGCTTACTCTGCGGTAAACCA6G 

GGCACAGACACCCGAAAATCATTCAGGCATTAAAGGATCAGGCTGATAAAATCACCC 

TCACGTCACGCGCGTTTCATAACGATCAGCTTGGGCCGTTTTACGAAAAAACAGCTAA 

ACTGACAGGCAAAGAGATGATTCTGCCGATGAATACAGGAGCCGAAGCGGTTGAATC 

CGCGGTGAAAGCGGCGAGACGCTGGGCGTATGAAGTGAAGGGCGTAGCTGACAAT 

CAAGCGGAAATTATCGCATGTGTCGGGAACTTCCACGGCCGCACGATGCTGGCGGT 

ATCTCTTTCTTCTGAAGAGGAATATAAACGAGGATTCGGCCCGATGCTTCCAGGAATC 

AAAGTCATTCCTTACGGCGATGTGGAAGCGCTTCGACAGGCCATTACGCCGAATACA 

GCGGCATTCTTGTTTGAACCGATTCAAGGCGAAGCGGGCATTGTGATTCCGCCTGAA 

GGATTTTTACAGGAAGCGGCGGCGATTTGTAAGGAAGAGAATGTCTTGTTTATTGCG 

GATGAAATTCAGACGGGTCTCGGACGTACAGGCAAGACGTTTGCCTGTGACTGGGA 

CGGCATTGTTCCGGATATGTATATCTTGGGCAAAGCGCTTGGCGGCGGTGTGTTCCC 

GATCTCTTGCATTGCGGCGGACCGCGAGATCCTAGGCGTGTTTAACCCTGGCTCACA 
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CGGCTCAACATTTGGTGGAAACCCGCTTGCATGTGCAGTGTCTATCGCTTCATTAGAA 

GTGCTGGAGGATGAAAAGCTGGCGGATCGTTCTCTTGAACTTGGTGAATACTTTAAA 

AGCGAGCTTGAGAGTATTGACAGCCCTGTCATTAAAGAAGTCCGCGGCAGAGGGCT 

GTTTATCGGTGTGGAATTGACTGAAGCGGCACGTCCGTATTGTGAGCGTTTGAAGGA 

AGAGGGACTTTTATGCAAGGAAACGCATGATACAGTCATTCGTTTTGCACCGCCATTA 

ATCATTTCCAAAGAGGACTTGGATTGGGCGATAGAGAAAATTAAGCACGTGCTGCGA 

AACGCA (SEQ ID NO:33). 

The deduced amino acid sequence of the RocD protein is: 

MTALSKSKEIIDQTSHYGANNYHPLPIVISEALGAWVKDPEGNEYIVIDIVILSAYSAVNQGHR 

HPKIIQALKDQADKITLTSRAFHNDQLGPFYEKTAKLTGKEiy/IILPIVINTGAEAVESAVKAAR 

RWAYEVKGVADNQAEIIACVGNFHGRTMLAVSLSSEEEYKRGFGPMLPGIKLIPYGDVEA 

LRQAITPNTAAFLFEPIQGEAGiVIPPEGFLQEAAAICKEENVLFIADEIQTGLGRTGK 

TFACDWDGIVPDIVIYILGKALGGGVFPISCIAADREILGVFNPGSHGSTFGGNPLACAVSI 

ASLEVLEDEKLADRSLELGEYFKSELESIDSPVIKEVRGRGLFIGVELTEAARPYCERLK 

EEGLLCKETHDTVIRFAP|iLI!SKEDLDWAIEKIKHVLRNA(SEQ ID NO:34). 



Additionally, the coding region is found at about 4143328-4144530 bp of the fl. 
subtllis 168 chromosome. 



The mcF coding sequence of the rocF protein (arginase) of B. subtUls 168 is 
shown below: 

ATGGATAAAACGATTTCGGTTATTGGAATGCCAATGGATTTAGGAGAAGCACGACGC 

GGAGTGGATATGGGCCCGAGTGCCATCCGGTACGCTCATCTGATCGAGAGGCTGTC 

AGACATGGGGTATACGGTTGAAGATCTCGGTGACATTCCGATCAATCGCGAAAAAAT 

CAAAAATGACGAGGAACTGAAAAACCTGAATTCCGTTTTGGCGGGAAATGAAAAACT 

CGCGCAAAAGGTCAACAAAGTCATTGAAGAGAAAAAATTCCCGCTTGTCCTGGGCGG 

TGACCACAGTATTGCGATCGGCACGCTTGCAGGCACAGCGAAGCATTACGATAATCT 

CGGCGTCATCTGGTATGACGCGCACGGCGATTTGAATACACTTGAAACTTCACCATC 

GGGCAATATTCACGGCATGCCGCTCGCGGTCAGCCTAGGCATTGGCCACGAGTCAC 

TGGTTAACCTTGAAGGCTACGCGCCTAAAATCAAACCGGAAAACGTCGTCATCATTG 

GCGCCCGGTCACTTGATGAAGGGGAGCGCAAGTACATTAAGGAAAGCGGCATGAAG 

GTGTACACAATGCACGAAATCGATCGTCTTGGCATGACAAAGGTCATTGAAGAAACC 

CTTGATTATTTATCAGCATGTGATGGCGTCCATCTGAGCCTTGATCTGGACGGACTTG 

ATCCGAACGACGCACCGGGTGTCGGAACCCCTGTCGTCGGCGGCATCAGCTACCGG 

GAGAGCCATTTGGCTATGGAAATGCTGTATGACGCAGGCATCATTACCTCAGCCGAA 

TTCGTTGAGGTTAACCCGATCCTTGATCACAAAAACAAAACGGGCAAAACAGCAGTA 

GAGCTCGTAGAATCCCTGTTAGGGAAGAAGCTGCTG (SEQ ID NO:35). 



The deduced amino acid sequence of the RocF protein: 

IVlDKTISVIGMPiVIDLGQARRGVDMGPSAIRYAHLIERLSDMGYTVEDLGDIPINREKIKND 

EELKNLNSVLAGNEKLAQKVNKVIEEKKFPLVLGGDHSIAIGTLAGTAKHYDNLGVIWYD 

AHGDLNTLETSPSGNIHGMPLAVSLGIGHESLVNLEGYAPKIKPENWIIGARSLDEGER 

KYiKESGMKVYTMHEIDRLGIVITKVIEETLDYLSACDGVHLSLDLDGLDPNDAPGVGTPW 

GGISYRESHLAIVIEMLYDAGIITSAEFVEVNPILDHKNKTGKTAVELVESLLGKKLL (SEQ ID 
NO:36). 
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Additionally, the coding region is found at about 4140738-4141625 bp of the B. 
SI/M//S 168 chromosome. 



The Tdh coding sequence of the Tdh protein (threonine 3-dehydrogenase) of B. 
subtilis 168 is shown below: 

ATGCAGAGTGGAAAGATGAAAGCTCTAATGAAAAAGGACGGGGCGTTCGGTGCTGT 

GCTGACTGAAGTTCCCATTCCTGAGATTGATAAACATGAAGTCCTCATAAAAGTGAAA 

GCCGCTTCCATATGCGGCACGGATGTCCACATTTATAATTGGGATCAATGGGCACGT 

CAGAGAATCAAAACACCCTATGTTTTCGGCCATGAGTTCAGCGGCATCGTAGAGGGC 

GTGGGAGAGAATGTCAGCAGTGTAAAAGTGGGAGAGTATGTGTCTGCGGAAACACA 

CATTGTCTGTGGTGAATGTGTCCCTTGCCTAACAGGAAAATCTCATGTGTGTACCAAT 

ACTGCTATAATCGGAGTGGACACGGCAGGCTGTTTTGCGGAGTATGTAAAAGTTCCA 

(GCTGATAACATTTGGAGAAATCCCGCTGATATGGACCCGTCGATTGCTTCCATTCAAG 

AGCCTTTAGGAAATGCAGTTCATACCGTACTCGAGAGCCAGCCTGCAGGAGGAACGA 

CTGCAGTCATTGGATGCGGACCGATTGGTCTTATGGCTGTTGCGGTTGCAAAAGCAG 

CAGGAGCTTCTCAGGTGATAGCGATTGATAAGAATGAATACAGGCTGAGGCTTGCAA 

AACAAATGGGAGCGACTTGTACTGTTTCTATTGAAAAAGAAGACCCGCTCAAAATTGT 

AAGCGCTTTAACGAGTGGAGAAGGAGCAGATCTTGTTTGTGAGATGTCGGGCCATCC 

CTCAGCGATTGCCCAAGGTCTTGCGATGGCTGCGAATGGCGGAAGATTTCATATTCT 

CAGCTTGCCGGAACATCCGGTGACAATTGATTTGACGAATAAAGTGGTATTTAAAGG 

GCTTACCATCCAAGGAATCACAGGAAGAAAAATGTTTTCAACATGGCGCCAGGTGTC 

TCAGTTGATCAGTTCAAACATGATCGATCTTGCACCTGTTATTACCCATCAGTTTCCAT 

TAGAGGAGTTTGAAAAAGGTTTCGAACTGATGAGAAGCGGGCAGTGCGGAAAAGTAA 
TTTTAATTCCA (SEQ ID NO:37). 



The deduced amino acid sequence of the Tdh protein is: 

MQSGKMKALMKKDGAFGAVLTEVPIPEIDKHEVLIKVKAASICGTDVHIYNWDQWARQRI 

KTPYVFGHEFSGIVEGVGENVSSVKVGEYVSAETHIVCGECVPCLTGKSHVCTNTAIIGV 

DTAGCFAEYVKVPADNIWRNPADMDPSIASIQEPLGNAVHTVLESQPAGGTTAVIGCGPI 

GLMAVAVAKAAGASQVIAIDKNEYRLRLAKQMGATCTVSIEKEDPLKIVSALTSGEGADLV 

CEMSGHPSAIAQGLAMAANGGRFHILSLPEHPVTIDLTNKWFKGLTIQGITGRKMFSTW 

RQVSQLISSNMIDUPVITHQFPLEEFEKGFELMRSGQCGKVILIP (SEQ ID NO:38). 

Additionally, the coding region is found at about 1769731 - 1770771 bp of the fi. 
subtttis 168 chromosome. 



The coding sequences Ibr the tryptophan operon regulatory region and genes 

trpE (SEQ ID NO:48), trpD (SEQ ID NO:46), trpC (SEQ ID NO:44), trpF (SEQ ID NO:50), 

trpB (SEQ ID NO:42), and trpA (SEQ ID NO:40) are shown below. The operon regulatory 

region is underlined. The trpE start (ATG) is shown in bold, followed as well by the tipD, 

trpC trpF, trpB. and trpA starts (also indicated in bold, in the order shown). 

TAATACGATAAGAACAG CTTAGAAATACACAAGAGTGTGTATAAAGCAATTAnAAf fiA 
GTTGAGTTAGAGAATAG GGTAGCAGAGAATGAGTTTAGTTGAGCTGAGACATTATRTT 
TATTCTACCCAAAAGAAGT CTTTCTTTTGGGTTTATTTGTTATATAGTATTTTATCCTCT 
CATGCCATCTTCTCATTr.T CCTTGCCATAAGGAGTGAQAGCAA TGAATTTr.rAATrAA 
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ACAmCCGCATTTTTAGAGGACAGCTTGTCCCACCACACGATACCGATTGTGGAGAC 

CTTCACAGTCGATACACTGACACCCATTCAAATGATAGAGAAGCTTGACAGGGAGATT 

ACGTATCTTCTTGAAAGCAAGGACGATACATCCACTTGGTCCAGATATTCGTTTATCG 

GCCTGAATCCATTTCTCACAATTAAAGAAGAGCAGGGCCGTTTTTCGGCCGCTGATC 

AGGACAGCAAATCTCTTTACACAGGAAATGAACTAAAAGAAGTGCTGAACTGGATGAA 

TACCACATACAAAATCAAAACACCTGAGCTTGGCATTCCTTTTGTCGGCGGAGCTGTC 

GGGTACTTAAGCTATGATATGATCCCGCTGATTGAGCCTTCTGTTCCTTCGCATACCA 

AAGAAACAGACATGGAAAAGTGTATGCTGTTTGTTTGCCGGACATTAATTGCGTATGA 

TCATGAAACCAAAAACGTCCACTTTATCCAATATGCAAGGCTCACTGGAGAGGAAACA 

AAAAACGAAAAAATGGATGTATTCCATCAAAATCATCTGGAGCTTCAAAATGTCATTGA 

AAAAATGATGGACCAAAAAAACATAAAAGAGCTGTTTCTTTCTGCTGATTCATACAAGA 

CACCCAGCTTTGAGACAGTATCTTCTAATTATGAAAAATCGGCTTTTATGGCTGATGTA 

GAAAAAATCAAAAGCTATATAAAAGCAGGCGATATCTTGCAGGGTGTTTTATCACAAA 

AATTTGAGGTGCCGATAAAAGCAGATGCTTTTGAGTTATACCGAGTGCTTAGGATCGT 

CAATCCTTCGCCGTATATGTATTATATGAAACTGCTAGACAGAGAAATAGTCGGCAGC 

TCTCCGGAACGGTTAATACACGTTCAAGACGGGCACTTAGAAATCCATCCGATTGCC 

GGTACGAGAAAACGCGGTGCAGACAAAGCTGAAGATGAGAGACTGAAGGTTGAGCT 

CATGAAGGATGAAAAAGAAAAAGCGGAGCATTACATGCTCGTTGATCTTGCCCGAAA 

CGATATCGGCAGAGTAGCAGAGTATGGTTCTGTTTCTGTGCCGGAGTTCACAAAAAT 

TGTTTCCTTTTCACATGTCATGCACATTATCTCGGTGGTTACAGGCCGATTGAAAAAA 

GGGGTTCATCCTGTCGATGCACTGATGTCTGCTTTCCCGGCGGGGACTTTAACAGGC 

GCACCCAAAATCCGTGCCATGCAGCTTTTGCAAGAACTCGAGCCAACACCGAGAGAG 

ACATACGGAGGGTGTATTGCCTACATTGGGTTTGACGGGAATATCGACTCTTGTATTA 

CGATTCGCACGATGAGTGTAAAGAACGGTGTTGCATCGATACAGGCAGGTGCTGGC 

ATTGTTGCTGATTCTGTTpCGGAAGCCGAATACGAAGAAAGCTGTAATAAAGGCGGT 

GCGCTGCTGAAAACGATTGATATTGCAGAAGACATGTTTCATAGCAAGGAGGATAAA 

GCTGATGAACAGATTTGTACAATTGTGGGTTGACGGAAAAACCCTTACTGCCGGTGA 

GGCTGAAACGCTGATGAATATGATGATGGCAGCGGAAATGACTCCTTCTGAAATGGG 

GGGGATATTGTCAATTCTTGCTCATCGGGGGGAGACGCCAGAAGAGCTTGCGGGTT 

TTGTGAAGGCAATGCGGGCACACGCTCTTACAGTCGATGGACTTCCTGATATTGTTG 

ATACATGCGGAACAGGGGGAGACGGTATTTCCACTTTTAATATCTCAACGGCCTCGG 

CAATTGTTGCCTCGGCAGCTGGTGCGAAAATCGCTAAGCATGGCAATCGCTCTGTCT 

CTTCTAAAAGCGGAAGCGCTGATGTTTTAGAGGAGCTAGAGGTTTCTATTCAAACCAC 

TCCCGAAAAGGTCAAAAGCAGCATTGAAACAAACAACATGGGATTTCTTTTTGCGCCG 

CTTTACCATTCGTCTATGAAACATGTAGCAGGTACTAGAAAAGAGCTAGGTTTCAGAA 

CGGTATTTAATCTGCTTGGGCCGCTCAGCAATCCTTTACAGGCGAAGCGTCAGGTGA 

TTGGGGTCTATTCTGTTGAAAAAGCTGGACTGATGGCAAGCGCACTGGAGACGTTTC 

AGCCGAAGCACGTTATGTTTGTATGAAGCCGTGACGGTTTAGATGAGCTTTCAATTAC 

AGCACCGACCGACGTGATTGAATTAAAGGACGGAGAGCGCCGGGAGTATACCGTTT 

CACCCGAAGATTTCGGTTTCACAAATGGCAGACTTGAAGATTTACAGGTGCAGTCTCC 

GAAAGAGAGCGCTTATCTCATTCAGAATATTTTTGAAAATAAAAGCAGCAGTTCCGCT 

TTATCTATTACGGCTTTTAATGCGGGTGCTGCGATTTACACGGCGGGAATTACCGCCT 

CACTGAAGGAAGGAACGGAGCTGGCGTTAGAGACGATTACAAGCGGAGGCGCTGCC 

GCGCAGCTTGAACGACTAAAGCAGAAAGAGGAAGAGATCTATGCTTGAAAAAATCAT 

CAAACAAAAGAAAGAAGAAGTGAAAACACTGGTTCTGCCGGTAGAGCAGCCTTTCGA 

GAAACGTTCATTTAAGGAGGCGCCGGCAAGCCCGAATCGGTTTATCGGGTTGATTGC 

CGAAGTGAAGAAAGCATCGCCGTCAAAAGGGCTTATTAAAGAGGATTTTGTACCTGT 

GCAGATTG CAAAA GACTATGAGGCTGCGAAGGCAGATGCGATTTCCGTTTTAACAGA 

CACCCCGTTTTTTCAAGGGGAAAACAGCTATTTATCAGACGTAAAGCGTGCTGTTTCG 

ATTOCTGTACTTAGAAAAGATTTTATTATTGATTCTCTTCAAGTAGAGGAATCAAGAAG 

AATCGGAGCGGATGCCATATTGTTAATCGGCGAGGTGCTTGATCCCTTACACCTTCAT 

GAATTATATCTTGAAGCAGGTGAAAAGGGGATGGACGTGTTAGTGGAGGTTCATGAT 

GCATCAACGCTAGAACAAATATTGAAAGTGTTCACACCCGACATTCTCGGCGTAAATA 
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ATCGAAACCTAAAAACGTTTGAAACATCTGTAAAGCAGACAGAACAAATCGCATCTCT 

CGTTCCGAAAGAATCCTTGCTTGTCAGCGAAAGCGGAATCGGTTCTTTA6AACATTTA 

ACATTTGTCAATGAACATGGGGCGCGAGCTGTACTTATCGGTGAATCATTGATGAGA 

CAAACTTCTCAGCGTAAAGCAATCCATGCTTTGTTTAGGGAGTGAGGTTGTGAAGAAA 

CCGGCATTAAAATATTGCGGTATTCGGTCACTAAAGGATTTGCAGCTTGCGGCGGAA 

TCACAGGCTGATTACCTAGGATTTATTTTTGCTGAAAGCAAACGAAAAGTATCTCCGG 

AAGATGTGAAAAAATGGCTGAACCAAGTTCGTGTCGAAAAACAGGTTGCAGGTGTTTT 

TGTTAATGAATCAATAGAGACGATGTCACGTATTGCCAAGAGCTTGAAGCTCGACGTC 

ATTCAGCTTCACGGTGATGAAAAACCGGCGGATGTCGCTGCTCTTCGCAAGCTGACA 

GGCTGTGAAATATGGAAGGCGCTTCACCATCAAGATAACACAACTCAAGAAATAGGC 

CGCTTTAAAGATAATGTTGACGGCTTTGTGATTGATTCATCTGTAAAAGGGTCTAGAG 

GCGGAACTGGTGTTGCATTTTCTTGGGACTGTGTGCCGGAATATCAGCAGGCGGCTA 

TTGGTAAACGCTGCTTTATCGCTGGCGGCGTGAATCCGGATAGCATCACACGCCTAT 

TGAAATGGCAGCCAGAAGGAATTGACCTTGCCAGCGGAATTGAAAAAAACGGACAAA 

AAGATCAGAATCTGATGAGGGTTTTAGAAGAAAGGATGAACCGATATGTATCCATATC 

CGAATGAAATAGGCAGATACGGTGATTTTGGCGGAAAGTTTGTTCCGGAAACACTCA 

TGCAGCCGTTAGATGAAATACAAACAGCATTTAAACAAATCAAGGATGATCCCGCTTT 

TCGTGAAGAGTATTATAAGCTGTTAAAGGACTATTCCGGACGCCCGACTGCATTAACA 

TACGCTGATCGAGTCACTGAATACTTAGGCGGCGCGAAAATCTATTTGAAACGAGAA 

GATTTAAACCATACAGGTTCTCATAAAATCAATAATGCGCTAGGTCAAGCGCTGCTTG 

CTAAAAAAATGGGCAAAACGAAAATCATTGCTGAAACCGGTGCCGGCCAGCATGGTG 

TTGCCGCTGCAACAGTTGCAGCCAAATTCGGCTTTTCCTGTACTGTGTTTATGGGTGA 

AGAGGATGTTGCCCGCCAGTCTCTGAACGTTTTCCGCATGAAGCTTCTTGGAGCGGA 

GGTAGTGCCTGTAACAAGCGGAAACGGAACATTGAAGGATGCCACAAATGAGGCGA 

TCCGGTACTGGGT 

TCAGCATTGTGAGGATCACTTTTATATGATTGGATCAGTTGTCGGCCCGCATCCTTAT 

CCGCAAGTGGTCCGTGAATTTCAAAAAATGATCGGAGAGGAAGCGAAGGATCAGTTG 

AAACGTATtGAAGGCACTATGCCTGATAAAGTAGTGGCATGTGTAGGCGGAGGAAGC 

AATGCGATGGGTATGTTTCAGGCATTTTTAAATGAAGATGTTGAACTGATCGGCGCTG 

AAGCAGCAGGAAAAGGAATTGATACACCTCTTCATGCCGCCACTATTTGGAAAGGAA 

CCGTAGGGGTTATTCACGGTTCATTGACTTATCTCATTCAGGATGAGTTCGGGCAAAT 

TATTGAGCCCTACTCTATTTCAGCCGGTCTCGACTATCCTGGAATCGGTCCGGAGCA 

TGCATATTTGCATAAAAGGGGCCGTGTCACTTATGACAGTATAACCGATGAAGAAGC 

GGTGGATGCATTAAAGCTTTTGTCAGAAAAAGAGGGGATTTTGCCGGCAATCGAATC 

TGCCCATGCGTTAGCGAAAGCATTCAAACTCGCCAAAGGAATGGATCGCGGTCAACT 

CATTCTCGTCTGTTTATCAGGCCGGGGAGACAAGGATGTCAACACATTAATGAATGTA 

TTGGAAGAAGAGGTGAAAGCCCATGTTTAAATTGGATCTTCAACCATCAGAAAAATTG 

TTTATCCCGTTTATTACGGCGGGCGATCCAGTTCCTGAGGTTTCGATTGAACTGGCG 

AAGTCACTCCAAAAAGCAGGCGCCACAGCATTGGAGCTTGGTGTTGCATACTCTGAC 

CCGCTTGCAGACGGTCCGGTGATCCAGCGGGCTTCAAAGCGGGCGCTTGATCAAGG 

AATGAATATCGTAAAGGCAATCGAATTAGGCGGAGAAATGAAAAAAAACGGAGTGAA 

TATTCCGATTATCCTCTTTACGTATTATAATCCTGTGTTACAATTGAACAAAGAATACTT 

TTTCGCTTTACTGCGGGAAAATCATATTGACGGTCTGCTTGTTCCGGATCTGCCATrA 

GAAGAAAGCAACAGCCTTCAAGAGGAATGTAAAAGCCATGAGGTGACGTATATTTCTT 

TAGTTGCGCCGACAAGCGAAAGCCGTTTGAAAACCATTATTGAACAAGCCGAGGGGT 

TCGTCTACTGTGTATCTTCTCTGGGTGTGACCGGTGTCCGCAATGAGTTCAATTCATC 

CGTGTACCCGTTCATTCGTACTGTGAAGAATCTCAGCACTGTTCCGGTTGCTGTAGG 

GTTCGGTATATCAAACCGTGAACAGGTCATAAAGATGAATGAAATTAGTGACGGTGTC 

GTAGTGGGAAGTGCGCTCGTCAGAAAAATAGAAGAATTAAAGGACCGGCTCATCAGC 

GCTGAAACGAGAAATCAGGCGCTGCAGGAGTTTGAGGATTATGCAATGGGGTTTAGC 

GGCTTGTACAGTTTAAAA (SEQ ID NO:39). 
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The deduced TrpA protein (tryptophan synthase (alpha subunit)) sequence is: 

MFKLDLQPSEKLFIPFITAGDPVPEVSIEl^KSLQKAGATALELGVAYSDPLADGPVIQR 
ASKI^LDQGiy/INIVKAiELGGEI\/IKKNGVNIPIILFTYYNPVLQLNKEYFFALLRENHIDGL 
LVPDLPLEESNSLQEECKSHEVTYISLVAPTSESRLKTIIEQAEGFVYCVSSLGVTGVRN 
EFNSSVYPFiRTVKNLSTVPVAVGFGISNREQVIKI\/lNEISDGVWGSALVRKiEELKDRL 
ISAETRNQALQEFEDYAiVIAFSGLYSLK(SEQ ID NO:41). 



The deduced TipB protein (tryptophan synthase (beta subunit)) sequence is: 

MYPYPNEIGRYGDFGGKFVPETLMQPLDEIQTAFKQIKDDPAFREEYYKLLKDYSGRPTA 

LTYADRVTEYLGGAKIYLKREDLNHTGSHKINNALGQALLAKKMGKTKIIAETGAGQHGVA 

AATVAAKFGFSCTVFMGEEDVARQSLNVFRMKLLGAEWPVTSGNGTLKDATNEAIRYW 

VQHCEDHFYMIGSWGPHPYPQWREFQKMIGEEAKDQLKRIEGTMPDKWACVGGGS 

NAMGMFQAFLNEDVELIGAEAAGKGIDTPLHAATISKGTVGVIHGSLTYLIQDEFGQIIEPY 

SISAGLDYPGIGPEHAYLHKSGRVTYDSITDEEAVDALKLLSEKEGILPAIESAHAl^KAFKL 

AKGMDRGQLILVCLSGRGDKDVNTLMNVLEEEVKAHV(SEQ ID NO:43). 

The deduced TrpC protein indol-3-glycerol phosphate synthase) sequence is: 

MLEKilKQKKEEVKTLVLPVEQPFEKRSFKEAPASPNRFIGLIAEVKKASPSKGLIKEDF 
VPVQIAKDYEAAKADAISVLTDTPFFQGENSYLSDVKRAVSIPVLRKDFIIDSLQVEESR 
RIGADAILLIGEVLDPLHLHELYLEAGEKGMDVLVEVHDASTLEQILKVFTPDILGVNNR 
NLKTFETSVKQTEQIASLVPKESLLVSESGIGSLEHLTFVNEHGARAVLIGESLMRQTSQ 
RKAIHALFRE (SEQ ID NO:45). 



The deduced TrpD protein (anthranilate phosphoribosyltransferase) sequence is: 

MNRFLQLCVDGKTLTAGEAETLI\^NMMMAAEMTPSEMGGILSILAHRGETPEELAGFVKA 

MRAHALTVDGLPDIVDTCGTGGDGISTFNISTASAIVASAAGAKIAKHGNRSVSSKSGSAD 

VLEELEVSIQTTPEKVKSSIETNNIVIGFLFAPLYHSSMKHVAGTRKELGFRTVFNLLGPLSN 

PLQAKRQVIGVYSVEKAGLMASALETFQPKHVMFVSSRDGLDELSITAPTDVIELKDGER 

REYTVSPEDFGFTNGRLEDLQVQSPKESAYLIQNIFENKSSSSALSITAFNAGAAIYTAGIT 

ASLKEGTELALETITSGGAAAQLERLKQKEEEIYA(SEQ ID NO:47). 



The deduced TrpE protein (anthranilate synthase) sequence is: 

MNFQSNISAFLEDSLSHHTIPIVETFTVDTLTPIQMIEKLDREITYLLESKDDTSTWSRY 

SFIGLNPFLTIKEEQGRFSAADQDSKSLYTGNELKEVLNWMNTTYKIKTPELGIPFVGGA 

VGYLSYDMIPLIEPSVPSHTKETDMEKCMLFVCRTLIAYDHETKNVHFIQYARLTGEETK 

NEKMDVFHQNHLELQNLIEKIVIMDQKNIKELFLSADSYKTPSFETVSSNYEKSAFMADVEK 

IKSYIKAGDIFQGVLSQKFEVPIKADAFELYRVLRIVNPSPYMYYMKLLDREIVGSSPERLIH 

VQDGHLEIHPIAGTRKRGADKAEDERLKVEL!\/IKDEKEKAEHYMLVDLARNDIGRVAEYG 

SVSVPEFTKIVSFSHVMHIISWTGRLKKGVHPVDALMSAFPAGTLTGAPKIRAMQLLQEL 

EPTPRETYGGCIAYIGFDGNIDSCITIRTMSVKNGVASIQAGAGIVADSVPEAEYEESCNKA 

GALLXTIHIAEDMFHSKEDKADEQISTIVR (SEQ ID NO:49). 

The deduced TrpF protein (phosphoribosyl anthranilate isomerase) sequence is: 
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MKKPALKYCGIRSLKDLQLAAESQADYLGFJFAESKRKVSPEDVKKWLNQVRVEKQVAG 
VFVNESIETMSRIAKSLKLDVIQLHGDEKPADVAALRKLTGCEIWKALHHQDNTTQEIARF 
KDNVDGFVIDSSVKGSRGGTGVAFSWDCVPEYQQAAIGKRCFIAGGVNPDSITRLLKWQ 
PEGIDLASGIEKNGQKDQNLMRLLEERMNRYVSISE (SEQ ID N0:51). 

Additionally, the coding region is found at about 2370707 bp to 2376834 bp 
(first bp = 2376834; last bp = 2370707) bp of the fi. subtills 168 chromosome. 



The ycgM coding sequence of the ycgM protein (similar to proline oxidase) of B. 
subtilis 168 is shown below: 

GTGATCACAAGAGAmmCTTATTTTTATCCAAAAGCGGCTTTCTCAATAAAATGGC 

GAGGAACTGGGGAAGTCGGGTAGCAGCGGGTAAAATTATCGGCGGGAATGACTTTA 

ACAGTTCAATCCCGACCATTCGACAGCTTAACAGCCAAGGCTTGTCAGTTACTGTCGA 

TCATTTAGGCGAGTTTGTGAACAGCGCCGAGGTCGCACGGGAGCGTACGGAAGAGT 

GCATTCAAACCATTGCGACCATCGCGGATCAGGAGCTGAACTCACACGTTTCTTTAAA 

AATGACGTCTTTAGGTTTGGATATAGATATGGATTTGGTGTATGAAAATATGACAAAAA 

TCCTTCAGACGGCCGAGAAACATAAAATCATGGTCACCATTGACATGGAGGACGAAG 

TCAGATGCCAGAAAACGCTTGATATTTTCAAAGATTTCAGAAAGAAATACGAGCATGT 

GAGCACAGTGCTGCAAGCCTATCTGTACCGGACGGAAAAAGACATTGACGATTTGGA 

TTCTTTAAACCCGTTCCTTCGCCTTGTAAAAGGAGCTTATAAAGAATCAGAAAAAGTA 

GCTTTCCCGGAGAAAAGCGATGTCGATGAAAATTACAAAAAAATCATCCGAAAGCAG 

CTCTTAAACGGTCACTATACAGCGATTGCCACACATGACGACAAAATGATCGACTTTA 

CAAAGCAGCTTGCCAAGGAACATGGCATTGCCAATGACAAGTTTGAATTTCAGATGCT 

GTACGGCATGCGGTCGCAAACCCAGCTCAGCCTCGTAAAAGAAGGTTATAACATGAG 

AGTCTACCTGCCATACGGCGAGGATTGGTACGGCTACTTTATGAGACGCCTTGCAGA 

ACGTCCGTCAAACATTGCATTTGCTTTCAAAGGAATGACAAAGAAG (SEQ ID NO:52). 

The deduced amino acid sequence of the YcgM protein is: 

IVIITRDFFLFLSKSGFLNKMARNWGSRVAAGKIIGGNDFNSSIPTIRQLNSQGLSVTVDHL 
GEFVNSAEVARERTEECIQTIATIADQELNSHVSLKMTSLGLDIDMDLVYENMTKILQTA 
EKHKIMVTIDMEDEVRCQKTLDIFKDFRKKYEHVSTVLQAYLYRTEKDIDDLDSLNPFLR 
LVKGAYKESEKVAFPEKSDVDENYKKIIRKQLLNGHYTAIATHDDKMIDFTKQLAKEHGI 

ANDKFEFQMLYGMRSQTQLSLVKEGYNIVIRVYLPYGEDWYGYFMRRLAERPSNIAFAFK 
GMTKK (SEQ ID NO:53). 

Additionally, the coding region is found at about 3441 1 1-345019 bp of the B. 
subtilis 168 chromosome. 



The ycgf/V coding sequence of the ycgN protein (similar to 1-pynoline-5-carboxylate 
dehydrogenase) of 6. subtilis 1 68 is shown below: 



ATGACAACACCTTACAAACACGAGCCATTCACAAATTTCCAAGATCAAAACTACGTGG 

AAGCGTTTAAAAAAGCGCTT6CGACAGTAAGCGAATATTTAGGAAAAGACTATCCGCT 

TGTCATTAACGGCGAGAGAGTGGAAACGGAAGCGAAAATCGTTTCAATCAACCCAGC 

TGATAAAGAAGAAGTCGTCGGCCGAGTGTCAAAAGCGTCTCAAGAGCACGCTGAGC 

AAGCGATTCAAGCGGCTGCAAAAGCATTTGAAGAGTGGAGATACACGTCTCCTGAAG 

AGAGAGCGGCTGTCCTGTTCCGCGCTGCTGCCAAAGTCCGCAGAAGAAAACATGAA 
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TTCTCAGCTTTGCTTGTGAAAGAAGCAGGAAAGCCTTGGAACGAGGCGGATGCCGAT 

ACGGCTGAAGCGATTGACTTCATGGAGTATTATGCACGCCAAATGATCGAACTGGCA 

AAAGGCAAACCGGTCAACAGCCGTGAAGGCGAGAAAAACCAATATGTATACACGCCG 

ACTGGAGTGACAGTCGTTATCCCGCCTTGGAACTTCTTGTTTGCGATCATGGCAGGC 

ACAACAGTGGCGCCGATCGTTACTGGAAACACAGTGGTTCTGAAACCTGCGAGTGCT 

ACACCTGTTATTGCAGCAAAATTTGTTGAGGTGCTTGAAGAGTCCGGATTGCCAAAAG 

GCGTAGTCAACTTTGTTCCGGGAAGCGGATCGGAAGTAGGCGACTATCTTGTT GACC 

ATCCGAAAACAAGCCTTATCACATTTACGGGATCAAGAGAAGTTGGTACGAGAATTTT 

CGAACGCGCGGCGAAGGTTCAGCCGGGCCAGCAGCATTTAAAGCGTGTCATCGCTG 

AAATGGGCGGTAAAGATACGGTTGTTGTTGATGAGGATGCGGACATTGAATTAGCGG 

CTCAATCGATCTTTACTTCAGCATTCGGCTTTGCGGGACAAAAATGCTCTGCAGGTTC 

ACGTGCAGTAGTTCATGAAAAAGTGTATGATCAAGTATTAGAGCGTGTCATTGAAATT 

ACGGAATCAAAAGTAACAGCTAAACCTGACAGTGCAGATGTTTATATGGGACCTGTCA 

TTGACCAAGGTTCTTATGATAAAATTATGAGCTATATTGAGATCGGAAAACAGGAAGG 

GCGTTTAGTAAGCGGCGGTACTGGTGATGATTCGAAAGGATACTTCA TCAA ACCGAC 

GATCTTCGCTGACCTTGATCCGAAAGCAAGACTCATGCAGGAAGAAATTTTCGGACC 

TGTCGTTGCATTTTGTAAAGTGTCAGACTTTGATGAAGCTTTAGAAGTGGCAAACAAT 

ACTGAATATGGTTTGACAGGCGCGGTTATCACAAACAACCGCAAGCACATCGAGCGT 

GCGAAACAGGAATTCCATGTCGGAAACCTAtACTTCAACCGCAACTGTACAGGTGCT 

ATCGTCGGCTACCATCCGTTTGGCGGCTTCAAAATGTCGGGAACGGATTCAAAAGCA 

GGCGGGCCGGATTACTTGGCTCTGCATATGCAAGCAAAAACAATCAGTGAAATGTTC 

(SEQ ID NO:54). 

The deduced amino acid sequence of YcgN protein is: 

IVITTPYKHEPn-NFQDQNYVEAFKKALATVSEYLGKDYPLVINGERVETEAKiVSINPADK 

EEWGRVSKASQEHAEQAIQAAAKAFEEWRYTSPEERAAVLFI=IAAAKVRRRKHEFSALL 

VKEAGKPWNEADADTAEAIDFMEYYARQMIELAKGKPVNSREGEKNQYVYTPTGVTWI 

PPWNFLFAIIVIAGTTVAPIVTGNTWLKPASATPVIAAKFVEVLEESGLPKGVVNFVPGSGS 

EVGDYLVDHPKTSLITFTGSREVGTRIFERAAKVQPGQQHLKRVIAEIVIGGKDIVWDEDA 

DIELAAQSIFTSAFGFAGQKCSAGSIRAWHEKVYDQVLERVIEITESKVTAKPDSADVYMG 

PVIDQGSYDKIMSYIEIGKQEGRLVSGGTGDDSKGYFIKPTIFADLDPKARLI\4QEEIFGPW 

AFCKVSDFDEALEVANNTEYGLTGAVITNNRKHIERAKQEFHVGNLYFNRNCTGAIVGYH 

PFGGFKIVISGTDSKAGGPDYIJMHMQAKTISEiy/IF (SEQ ID NO:55). 

Additionally, the coding region is found at atwut 345039-346583 bp of the B. 
subtilis 168 chromosome. 

The sigD coding sequence of the sigD protein (RNA polymerase flagella, motility, 
chemotaxis and autolysis sigma factor) of 8. subtilis 168 is shown below. 



ATGCAATCCTTGAATTATGAAGATCAGGTGCTTTGGACGCGCTGGAAAGAGTGGAAA 

GATCCTAAAGCCGGTGACGACTTAATGCGCCGTTACATGCCGCTTGTCACATATCAT 

GTAGGCAGAATTTCTGTCGGACTGCCGAAATCAGTGCATAAAGACGATCTTATGAGC 

CTTGGTATGCTTGGTTTATATGATGCCCTTGAAAAATTTGACCCCAGCCGGGACTTAA 

AATTTGATACCTACGCCTCGTTTAGAATTCGCGGCGCAATCATAGACGGGCTTCGTAA 

AGAAGATTGGCTGCCCAGAACCTCGCGCGAAAAAACAAAAAAGGTTGAAGCAGCAAT 

TGAAAAGCTTGAACAGCGGTATCTTCGGAATGTATCGCCCGCGGAAATTGCAGAGGA 

ACTCGGAATGACGGTACAGGATGTCGTGTCAACAATGAATGAAGGTTTTTTTGCAAAT 

CTGCTGTCAATTGATGAAAAGCTCCATGATCAAGATGACGGGGAAAACATTCAAGTCA 

TGATCAGAGATGACAAAAATGTTCCGCCTGAAGAAAAGATTATGAAGGATGAACTGAT 

TGCACAGCTTGCGGAAAAAATTCACGAACTCTCTGAAAAAGAACAGCTGGTTGTCAG 
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TTTGTTCTACAMGAGGAGTTGACACTGACAGAMTCGGACMGTATTAAATCTTTCT 
AGGTCCCGCATATCTCAGATCCATTCAAAGGCATTATTTAAATTAAAGAATCTGCTGG 
AAAAAGTGATAGAA (SEQ ID NO:56). 

The deduced amino acid sequence of the SigD is: 

MQSLNYEDQVLWTRWKEWKDPI<AGDDLMRRYMPLVTYHVGRISVGLPKSVHKDDLMS 
LGMLGLYDALEKFDPSRDLKFDTYASFRIRGAIIDGLRKEDWLPRTSREKTKKVEAAIEKL 
EQRYLRNVSPAEIAEELGMTVQDWSTMNEGFFANLLSIDEKLHDQDDGENIQVMIRDDK 
NVPPEEKIMKDELIAQLAEKIHELSEKEQLWSLFYKEELTLTEIGQVLNLSTSRISQIHSKA 
LFKLKNLLEKVIQ (SEQ ID NO:57). 

Additionally, the coding region is found at about 1715786-1716547 bp of the B. 
subtilis 168 chromosome. 

As indicated above, it is contemplated that inactivated analogous genes found in 
other Bacillus hosts will find use in the present invention. 

In some preferred embodiments, the host cell is a member of the genus Bacillus, 
while in some embodiments, Vf\e Bacillus strain of interest is alkalophllic. Numerous 
alkalophilic 6ac///us strains are known (See e.g., U.S. Pat. 5,217.878; and Aunstrup etaL, 
Proc IV IFS: Femient. Technol. Today, 299-305 [1972]). In some prefenred embodiments, 
the Bacillus strain of interest is an industrial Bacillus strain. Examples of industrial Bacillus 
strains include, but are not limited to 6. lichenHbrmis, B. lentus, B. subtilis, and 6. 
amylollquefaclens. In additional embodiments, the Bacillus host strain is selected from the 
group consisting of B. lentus, B, brevis, B. stearothenvophllus, B. alkalophllus, B. 
coagulans, 6. circulans, B. pumilus, B. thuringlensis, B. clausii, and 6. megaterium, as 
well as other organisms within the genus Bacillus, as discussed above. In some 
particulariy prefenred embodiments, B. subtilis is used. For example, U.S. Patents 
5,264,366 and 4,760,025 (RE 34,606) describe various Bacillus host strains that find use 
in the present invention, although other suitable strains are contemplated for use in the 
present invention. 

An industrial strain may be a non-recombinant strain of a Bacillus sp., a mutant of 
a naturally occunring strain or a recombinant strain. Preferably, the host strain is a 
recombinant host strain wherein a polynucleotide encoding a polypeptide of Interest has 
been introduced into the host. A further prefenred host strain is a Bacillus subtilis host 
strain and particulariy a recombinant Bacillus subtilis host strain. Numerous B. subtilis 
strains are known, including but not limited to 1A6 (ATCC 39085), 168 (1A01), SB19, 
W23. Ts85. B637, PB1753 through PB1758. PB3360, JH642, 1A243 (ATCC 39,087), 
ATCC 21332. ATCC 6051, MI113. DE100 (ATCC 39,094), GX4931, PBT 110. and PEP 
211strain (See e.g., Hoch e/a/.. Genetics. 73:215-228 [1973]; U.S. Patent No. 4,450,235; 
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U.S. Patent No. 4,302,544; and EP 0134048). The use of B. subtilis as an expression 
host is further described by Palva et al. and others (See, Palva et aL, Gene 19:81-87 
[1982]; also see Fahnestocic and Fischer, J. Bacteriol., 165:796-804 [1986]; and Wang et 
aL, Gene 69:39-47 [1988]). . 

Industrial protease producing Bacillus strains provide particularly preferred 
expression hosts. In some preferred embodiments, use of these strains In the present 
Invention provides further enhancements In efflciency and protease production. Two 
general types of proteases are typically secreted by Bacillus sp., namely neutral (or 
"metalloproteases") and allcaline (or "serine") proteases. Serine proteases are enzymes 
which catalyze the hydrolysis of peptide bonds In which tiiere Is an essential serine 
residue at the active site. Serine proteases have molecular weights In the 25,000 to 
30,000 range (See, Priest, Bacteriol. Rev., 41:71 1-753 [1977]). Subtillsin is a prefen^d 
serine protease for use In the present Invention. A wide variety of Bacillus subtillsins have 
been identified and sequenced, for example, subtillsin 168, subtillsin BPH\ subtilisin 
Carisberg, subtilisin DY, subtillsin 147 and subtilisin 309 (See e.g., EP 414279 B; WO 
89/06279; and Stahl et aL, J. Bacteriol., 159:81 1--818 [1984]). In some embodiments of 
tiie present invention, tiie Bacillus host strains produce mutant (e.g., variant) proteases. 
Numerous references provide examples of variant proteases and reference (See e.g., WO 
99/20770; WO 99/20726; WO 99/20769; WO 89/06279; RE 34.606; U.S. Patent No. 
4,914,031; U.S. Patent No. 4,980,288; U.S. Patent No. 5.208,158; U.S. Patent No. 
5.310,675; U.S. Patent No, 5,336.611; U.S. Patent No. 5.399.283; U.S. Patent No. 
5,441,882; U.S. Patent No. 5.482.849; U.S. Patent No. 5,631.217; U.S. Patent No. 
5.665,587; U.S. Patent No. 5.700.676; U.S. Patent No. 5.741.694; U.S. Patent No. 
5,858,757; U.S. Patent No. 5,880,080; U.S. Patent No. 6,197,567; and U.S. Patent No. 
6,218,165). 

In yet another embodiment, a preferred Bacillus host is a Bacillus sp. that includes 
a mutation or deletion in at least one of the following genes, degil, degS, degR and degQ. 
Preferably the mutation is in a degU gene, and more preferably the mutation Is 
degU(Hy)32. (See, Msadek et aL, J. Bacteriol., 172:824-834 [1990]; and Olmos et aL, 
Mol. Gen. Genet, 253:562-567 [1997]), A most prefen-ed host strain is a Bacillus subtilis 
carrying a degU32(Hy) mutation. In a further embodiment, the Bacillus host comprises a 
mutation or deletion in scoC4, (See, Caldwell etaL, J. Bacteriol., 183:7329-7340 [2001]); 
spollE (See. Arigoni et aL, Mol IVIicrobiol., 31 :1407-1415 [1999]); oppA or other genes of 
the opp operon (See. Perego et aL, Mol Microbiol.. 5:173-185 [1991]). Indeed, it is 
contemplated that any mutation in the opp operon that causes the same phenotype as a 
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mutation in the oppA gene will find use in some embodiments of the altered Bac///us strain 
of the present invention. In some embodiments, these mutations occur alone, while in 
other embodiments, combinations of mutations are present. In some embodiments, an 
altered Bacillus of the invention is obtained from a Bacillus host strain that already 
Includes a mutation to one or more of the above-mentioned genes. In alternate 
embodiments, an altered Bacillus of the invention is further engineered to include mutation 
of one or more of the above-mentioned genes. 

In yet another embodiment, the incoming sequence comprises a selective marlcer 
located between two loxP sites (See. Kuhn and ToTes, Meth. Mol. Biol.,180:175-204 
[2002]), and the antimicrobial is then deleted by the action of Cre protein, in some 
embodiments, this results in the insertion of a single loxP site, as well as a deletion of 
native DNA, as determined by the primers used to construct homologous flanl^ing DNA 
and antimicrobial-containing incoming DNA. 

Those of skill in the art are well aware of suitable methods for introducing 
polynucleotide sequences into Bacillus cells (See e.g., Ferrari et al., "Genetics," in 
Hanwood etaL (ed.). Bacillus , Plenum Publishing Corp. [1989], pages 67-72; See a/so. 
Saunders et aL, J. Bacteriol., 157:718-726 [1984]; Hoch etaL, J. Bacteriol., 93:1925 -1937 
[1967]; Mann etaL, Cunrent Microbiol.. 13:131-135 [1986]; and Holubova. Folia Microbiol., 
30:97 [1985]; for a subtilis, Chang etaL, Mol. Gen. Genet, 168:11-115 [1979]; forS. 
megaterium, Vorobjeva etaL, FEMS Microbiol. Lett.. 7:261-263 [1980]; for S 
amyloliquefaciens, Smith et aL, Appl. Env. Microbiol.. 51 :634 (1986); for B. thuringiensis. 
Fisher et aL, Arch. Microbiol., 139:213-217 [1981]; and for 8. sphaericus, McDonald, J, 
Gen. Microbloi.,130:203 [1984]). Indeed, such methods as transformation including 
protoplast transformation and congrossion, transduction, and protoplast fusion ara known 
and suited for use in the present invention. Methods of transformation are particularly 
prefenred to introduce a DNA construct provided by the present invention Into a host cell. 

In addition to commonly used methods, in some embodiments, host cells are 
directly transformed {I.e., an intenmediate cell is not used to amplify, or othenvlse process, 
the DNA construct prior to introduction into the host cell). Introduction of the DNA 
constnjct into the host cell includes those physical and chemical methods Icnown In the art 
to introduce DNA into a host cell without insertion into a plasmid or vector. Such methods 
include, but are not limited to calcium chloride precipitation, electroporatlon, naked DNA, 
liposomes and the like. In additional embodiments, DNA constructs are co-transfonned 
with a plasmid, without being inserted into the plasmid. In further embodiments, a 
selective marker is deleted from the altered Bacillus strain by methods known in the art 
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(See, Stahl etal., J. BacterioL, 158:411-418 [1984]; and Palmeros ef a/., Gene 247:255 - 
264 [2000]). 

In some embodiments, host cells are transfomned with one or more DNA 
constructs according to the present Invention to produce an altered Bacillus strain wherein 
two or more genes have been inactivated in the host cell. In some embodiments, two or 
more genes are deleted from the host cell chromosome. In alternative embodiments, two 
or more genes are inactivated by insertion of a DNA construct. In some embodiments, the 
inactivated genes are contiguous (whether inactivated by deletion and/or insertion), while 
in other embodiments, they are not contiguous genes. 

There are various assays known to those of ordinary skill in the art for detecting and 
measuring activity of intracelluiarty and extracellularly expressed polypeptides. In particular, 
for proteases, there are assays based on the release of acid-soluble peptMes from casein or 
hemoglobin measured as absorbance at 280 nm or colorimetrically using the Folin method 
(See e.g., Bergmeyer ef a/., "Methods of Enzymatic Analysis" vol. 5, Peptidases. Proteinases 
and their Inhibitors. Verlag Chemie, Weinheim [1984]). Other assays involve the 
solubilization of chromogenic substrates (See e.g., Ward, "Proteinases," in Fogarty (ed.).. 
Microbial Enzymes and Biotechnoioov . Applied Science, London, [1983], pp 251-317). 
Other exemplary assays include succinyi-Ala-Ala-Pro-Phe-para nitroanilide assay 
(SAAPFpNA) and the 2,4,6-trinitrobenzene sulfonate sodium salt assay (TNBS assay). 
Numerous additional references known to those In the art provide suitable methods (See 
e.g., Wells etaL, Nucleic Acids Res. 11:7911-7925 [1983]; Christianson etal.. Anal. 
Biochem., 223:119 -129 [1994]; and Hsia etaL, Anal Biochem.,242:221-227 [1999]) . 

Means for determining the levels of secretion of a protein of interest in a host cell and 
detecting expressed proteins include the use of immunoassays with either polyclonal or 
monoclonal antibodies specific for the protein. Examples include enzyme-linked 
immunosorbent assay (ELISA). radioimmunoassay (RIA), fluorescence immunoassay (FIA), 
and fluorescent activated cell sorting (FAGS). However, other methods are known to those in 
the art and find use in assessing the protein of interest {See e.g., Hampton et a/.. Serological 
Methods. A Laboratory Manual . APS Press, St. Paul, MN [1990]; and Maddox et aL, J. Exp. 
Med., 158:1211 [1983]). In some preferred embodiments, secretion of a protein of interest is 
higher in the altered strain obtained using the present invention than in a corresponding 
unaltered host. As known in the art, the altered Bacillus cells produced using the present 
invention are maintained and grown under conditions suitable for the expression and 
recovery of a polypeptide of interest from cell culture (See e.g.. Hardwood and Cutting (eds.) 
Molecular Biolooical Methods for Bacillus. John Wiley & Sons [1990]). 
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B. Large Chromosomal Deletions 

As indicated above, in addition to single and multiple gene deletions, tlie present 
invention provides large chromosomal deletions. In some prefenred embodiments of the 

5 present invention, an indigenous chromosomal region or fragment thereof is deleted from 
a Bacillus host cell to produce an altered Bacillus strain. In some embodiments, the 
indigenous chromosomal region includes prophage regions, antimicrobial regions, (e.g., 
antibiotic regions), regulator regions, multi-contiguous single gene regions and/or operon 
regions. The coordinates delineating indigenous chromosomal regions referred to herein 

10 are specified according to the Bacillus subtills strain 1 68 chromosome map. Numbers 
generally relate to the beginning of the ribosomal binding site, if present, or the end of the 
coding region, and generally do not include a terminator that might be present The 
Bacillus subtills genome of strain 168 is well known (See, Kunst ef a/.. Nature 390:249- 
256 [1997]; and Henner etaL, Microbiol. Rev., 44:57-82 [1980]), and is comprised of one 

15 4215 kb chromosome. However, the present invention also includes analogous 

sequences from any Bacillus strain. Particulariy prefen-ed are other 6. subtills strains, B. 
Ilchenlfbrmis strains and 6. amylollquefaclens strains. 

In some embodiments, the indigenous chromosomal region includes prophage 
segments and fragments thereof. A "prophage segment" is viral DNA that has been 

20 inserted into the bacterial chromosome wherein the viral DNA is effectively 

indistinguishable from normal bacterial genes. The S. subtills genome is comprised of 
numerous prophage segments; these segments are not infective. (Seaman etaL, 
. Biochem., 3:607-«13 [1964]; and Stickler ef a/., Virol.. 26:142-145 [1965]). Although any 
one of the Bacillus subtilis prophage regions may be deleted, reference is made to the 

25 following non-limiting examples. 

One prophage region that is deleted jn some embodiments of the present invention 
is a Sigma K intervening "skin" element. This region is found at about 2652600 bp 
(spolVCA) to 2700579 bp (yqaS) of the 6. subtilis 168 chromosome. Using the present 
invention, about a 46 kb segment was deleted, corresponding to 2653562 bp to 2699604 

30 bp of the chromosome. This element is believed to be a remnant of an ancestral 

temperate phage which is position within the SIGK ORF. between the genes spolVCB and 
spolllC. However, it is not intended that the present invention be limited to any particular 
mechanism or mode of action involving the deleted region. The element has been shown 
to contain 57 open reading frames with putative rlbosome binding sites (See. Takemaru et 
1 35 a/., Microbiol., 141:323-327 [1995]). During spore formation in the mother cell, the skin 
element is excised leading to the reconstruction of the sigK gene. 
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Another region suitable for deletion is a propliage 7 region. This region is found at 
about 2701 208 bp {yrkS) to 2749572 bp (yraK) of the B. subtills 1 68 chromosome. Using 
the present invention, about a 48.5 kb segment was deleted, corresponding to 2701 087 bp 
to 2749642 bp of the chromosome. 

A further region is a sl<in + prophage 7 region. This region is found at about 
2652151 bp to 2749642 bp of the 6. subtilis 168 chromosome. Using the present 
invention, a segment of about 97.5 kb was deleted. This region also Includes the 
intervening spolllC gene. The skin/prophage 7 region includes but is not limited to the 
following genes: spo/VCA-DNA recombinase, btt (multidrug resistance), cypA (cytochrome 
P450-like enzyme), czcD (cation-efflux system membrane protein), and rapE (response 
regulator aspartate phosphatase). 

Yet another region is the PBSX region. This region is found at about 1319884 bp 
(xkdA) to 1347491 bp (xlyA) of the S. subtilis 168 chromosome. Using the present 
invention, a segment of about 29 kb was deleted, corresponding to 1319663 to1348691 bp 
of the chromosome. Under normal non-induced conditions this prophage element is non- 
Infective and is not bactericidal (except for a few sensitive strains such as W23 and S31). 
It Is inducible with mitomycin C and activated by the SOS response and results in cell lysis 
with the release of phage-like particles. The phage particles contain bacterial 
chromosomal DNAand kill sensitive bacteria without injecting DNA. (Canosi ef a/., J. Gen. 
Virol. 39: 81-90 [1978]). This region includes the following non-limiting list of genes: xtmA- 
S; x/lfcW - K and M - X, xre, xtrA, xpf, xep, xlilA - B and xlyA, 

A further region is the SPp region. This region is found at about 2150824 bp 
iyodU) to 2286246 bp (ypqP) of the 6. subtilis 168 chromosome. Using the present 
invention, a segment of about 133.5 kb was deleted, corresponding to 2151827 to 
2285246 bp of the chromosome. This element is a temperate prophage whose function 
has not yet been characterized. However, genes in this region include putative spore coat 
proteins (yodU, sspC, yokH), putative stress response proteins {yorD, yppQ, ypnP) and 
other genes that have homology to genes in the spore coat protein and stress response 
genes such as members of the yom operon. Other genes is tfiis region include: yot; yos, 
yoq, yop, yon, yom, yoz, yoi, yok, ypo, and ypm. 

An additional region is the prophage 1 region. This region is found at about 
202098 bp (ybbU) to 220015 bp {ybdE) of the fi. subtilis 168 chromosome. Using the 
present Invention, a segment of about 18.0 kb was deleted, corresponding to 2021 12 to 
220141 bp of tile chromosome. Genes In this region include the AdaA/B operon which 
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provides an adaptive response to DNA alkylation and nd/7F which codes for NADH 
dehydrogenase, subunit 5. 

A further region is the prophage 2 region. This region is found at about 529069 bp 
(ydcL) to 569493 bp {ydeJ) of the S. subtilis 168 chromosome. Using the present 
invention, a segment of about 40.5 l<b was deleted, corresponding to 529067 to 569578 bp 
of the chromosome. Genes in this region include rapl/phrl (response regulator asperate 
phosphatase), sac V (transcriptional regulator of the levansucrase) and cspC. 

Another region is the prophage 3 region. Using the present invention, a segment 
of about 50.7 kb segment was deleted, corresponding to about 652000 to 664300 bp of 
the fi. subtilis 168 chromosome. 

Yet another region is the prophage 4 region. This region is found at about 
1263017 bp (yyc/W) to 1313627 bp (yjoA) of the 6. subtilis 168 chromosome. Using the 
present invention, a segment of about 2.3 kb was deleted, corresponding to 1262987 to 
131 3692 bp of the chromosome. 

An additional region is the prophage 5 region. Using the present invention a 
segment of about 20.8 kb segment was deleted, conresponding to about 1879200 to 
1900000 bp of the B. subtilis 168 chromosome. 

Another region is the prophage 6 region. Using the present Invention a segment 
of about a 31 .9 kb segment was deleted, conresponding to about 2046050 to 2078000 bp 
in the B. subtilis 1 68 chromosome. 

In further embodiments, the indigenous chromosomal region includes one or more 
operon regions, multi-contiguous single gene regions, and/or anti-microbial regions. In 
some embodiments, these regions include the following: 

1 ) The PPS operon region: 

This region is found at about 1959410 bp (ppsE) to 1997178 bp {ppsA) of 
the Bacillus subtilis 168 chromosome. Using the present invention, a segment of 
about 38.6 kb was deleted, corresponding to about 1960409 to 1998026 bp of the 
chromosome. This operon region is Involved in antimicrobial synthesis and 
encodes plipastatin synthetase; 

2) The PKS operon region: 

This region Is found at about 1781 1 10 bp (pksA) to 1857712 bp {pksR) of 
the 8. subtilis 168 chromosome. Using the present invention, a segment of about 
76.2 kb was deleted, conresponding to about 1781795 to 1857985 bp of the 
chromosome. This region encodes polyketide synthase and is involved in anti- 
microbial synthesis. (Scotti etal., Gene, 130:65-71 [1993]); 
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3) The yvfF-yveK operon region: 

This region is found at about 3513149 bp {yvfF) to 3528184 bp (yveK) of 
the a subtilis 168 chromosome/ Using the present invention, a segment of about 
15.8 kb was deleted, conesponding to about 3513137 to 3528896 bp of the 
chromosome. This region codes for a putative polysaccharide (See, Dartois et aL, 
Seventh International Conference on Bacillus (1993) Institute Pasteur [1993], page 
56). This region includes the following genes; yvfA-F, yveK-T and sir. The s/r gene 
region which is found at about 3529014-3529603 bp of the B. subtilis 168 
chromosome encompasses about a 589 bp segment. This region is the regulator 
region of the yvfF-yveK operon; 

4) The DHB operon region: 

This region is found at about 3279750 bp {yukL) to 3293206 bp (yu/H) of 
the B. subtilis 168 chromosome. Using the present invention, a segment of about 
13.0 kb was deleted, corresponding to 3279418-3292920 bp of the chromosome. 
This region encodes the biosynthetic template for the catecholic siderophone 2,3- 
dihydroxy benzoate-glycine-threonine trimeric ester baciiibactin. (See, l\^ay etal., 
J. Biol. Chem., 276:7209-7217 [2001]). This region includes the following genes: 
yukL, yukM, dhbA -C,E and F, and yuil-H. 

While the regions, as described above, are examples of prefen-ed indigenous 
chromosomal regions to be deleted, in some embodiments of the present invention, a 
fragment of the region is also deleted. In some embodiments, such fragments include a 
range of about 1% to 99% of the indigenous chromosomal region. In other embodiments, 
fragments include a range of about 5% to 95% of the indigenous chromosomal region. In 
yet additional embodiments, fragments comprise at least 99%, 98%, 97%, 96%, 95%, 
94%, 93%, 92%, 90%, 88%, 85%, 80%, 75%. 70%. 65%, 50%, 40%, 30%, 25%, 20% and 
10% of the indigenous chromosomal region. 

Further non-limiting examples of fragments of indigenous chromosomal regions to 
be deleted with reference to the chromosomal location in the B. subtilis 168 chromosome 
include the following: 

a) for the skin region: 

i) a coordinate location of about 2666663 to 2693807, which includes 
yqcC to yqaM, and 

ii) a coordinate location of about 2658440 to 2659688, which includes 
rapEto phrE; 
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b) for the PBSX prophage region: 

i) a coordinate location of about 1320043 to 1345263, which includes 
xkdA to xkdX, and 

ii) a coordinate location of about 1326662 to 1345102, which includes 
xkdEtoxkdWi 

c) for the SPp region: 

i) a coordinate location of about 2149354 to 2237029. which includes 
yodVtoyonA; 

d) for the DHB region: 

i) a coordinate location of about 3282879 to 3291 353. which includes 
dhbF to dhbA ; 

e) for the yWF->ve/< region: 

I) a coordinate location of about 3516549 to 3522333, which Includes yvfB 
to yveQ, 

ii) a coordinate location of about 3513181 to 3528915, which includes yvfF 
to yveK, and 

iii) a coordinate location of about 3521233 to 3528205, which includes 
yveQ to yveL; 

f) for the prophage 1 region: 

i) a coordinate location of about 21 3926 to 22001 5, which includes ybcO 
to ybdE, and 

ii) a coordinate location of about 214146 to 220015, which includes ybcP 
to ybdE] 

g) for the prophage 2 region: 

i) a coordinate location of about 546867 to 559005, which includes mpl to 
cspC; and 

h) for the prophage 4 region: 

i) a coordinate location of about 1 26301 7 to 67542 1 , which Includes yJcM 

to yd] J. 



The number of fragments of indigenous chromosomal regions which are suitable 
for deletion are numerous, because a fragment may be comprised of only a few bps less 
than the identified indigenous chromosomal region. Furthermore, many of the identified 
indigenous chromosomal regions encompass a large number of genes. Those of skill in 
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the art are capable of easily determining which fragments of the indigenous chromosomal 
regions are suitable for deletion for use in a particular-application. 

The definition of an indigenous chromosomal region is not so strict as to exclude a 
number of adjacent nucleotides to the defined segment. For example, while the SPp 
region is defined herein as located at coordinates 2150824 to 2286246 of the S. subtilis 
168 chromosome, an indigenous chromosomal region may include a further 10 to 5000 
bp, a further 100 to 4000 bp. or a further 100 to 1000 bp on either side of the region. The 
number of bp on either side of the region is limited by the presence of another gene not 
included in the indigenous chromosomal region targeted for deletion. 

As stated above, the location of specified regions herein disclosed are in reference 
to the B. subtilis 168 chromosome. Other analogous regions firom Bacillus strains are 
included in the definition of an indigenous chromosomal region. While the analogous 
region may be found in any Bacillus strain, particulariy prefenred analogous regions are 
regions found in other Bacillus subtilis strains, Bacillus llchenlformls strains and Bacillus 
amylollquefacians strains. 

In certain embodiments, more than one indigenous chromosomal region or 
fragment thereof is deleted from a Bacillus strain. However, the deletion of one or more 
indigenous chromosomal regions or fragments thereof does not deleteriously affect 
reproductive viability of the strain which includes the deletion. In some embodiments, two 
indigenous chromosomal regions or fragments thereof are deleted. In additional 
embodiments, three indigenous chromosomal regions or fragments thereof are deleted. In 
yet another embodiment, four indigenous chromosomal regions or fragments thereof are 
deleted. In a further embodiment, five indigenous chromosomal regions or fragments 
thereof are deleted. In another embodiment, as many as 14 indigenous chromosomal 
regions or fragments thereof are deleted. In some embodiments, the indigenous 
chromosomal regions or fragments thereof are contiguous, while in other embodiments, 
they are located on separate regions of the Bacillus chromosome. 

A strain of any member of the genus Bacillus comprising a deleted indigenous 
chromosomal region or fragment thereof finds use in the present invention. In some 
prefen^ed embodiments, the Bacillus strain is selected from the group consisting of a 
subtilis strains, S. amyloliquefaciens strains, S. lentus strains, and S. licheniformis strains. 
In some preferred embodiments, the strain is an industrial Bacillus strain, and most 
preferably an industrial B. subtilis strain. In a further preferred embodiment, the altered 
Bacillus strain is a protease-producing strain. In some particulariy preferred embodiments, 
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it is a 6. subtilis strain that has been previously engineered to include a polynucleotide 
encoding a protease enzyme. 

As indicated above, a Bacillus strain in which an indigenous chromosomal region 
or fragment thereof has been deleted is refen-ed to herein as "an altered Bacillus strain." In 
preferred embodiments of the present invention, the altered Bacillus strain has an 
enhanced level of expression of a protein of interest {i.e., the expression of the protein of 
interest is enhanced, compared to a conresponding unaltered Bacillus strain grown under 
the same growth conditions). 

One measure of enhancement is the secretion of the protein of interest. In some 
embodiments, production of the protein of Interest is enhanced by at least 0.5%, 1.0%, 
1.5%, 2.0%, 2.5%, 3.0%, 4.0%, 5.0%. 8.0%. 10%, 15%. 20% and 25% or more, compared 
to the corresponding unaltered Bacillus strain. In other embodiments, production of the 
protein of interest is enhanced by between about 0.25% to 20%; 0.5% to 15% and 1 .0% 
to 10%, compared to the corresponding unaltered Bacillus strain as measured In grams of 
protein produced per liter. 

The altered Sac///U5 strains provided by the present invention comprising a deletion 
of an indigenous chromosomal region or fragment thereof are produced using any suitable 
methods, including but not limited to the following means. In one general embodiment, a 
DNA construct is introduced into a Bacillus host. The DNA constmct comprises an 
inactivating chromosomal segment, and in some embodiments, further comprises a 
selective marker. Preferably, the selective marker is flanked on both the 5' and 3' ends by 
one section of the inactivating chromosomal segment. 

in some embodiments, the inactivating chromosomal segment, while preferably 
having 100% sequence Identity to the immediate upstream and downstream nucleotides 
of an indigenous chromosomal region to be deleted (or a fragment of said region), has 
between about 70 to 100%, about 80 to 100%, about 90 to 100%, and about 95 to 100% 
sequence identity to the upstream and downstream nucleotides of the indigenous 
chromosomal region. Each section of the inactivating chromosomal segment must include 
sufficient 5' and 3' flanking sequences of the indigenous chromosomal region to provide 
for homologous recombination with the indigenous chromosomal region in the unaltered 
host. 

In some embodiments, each section of the inactivating chromosomal segment 
comprises about 50 to 1 0,000 base pairs (bp). However, lower or higher bp sections find 
use in the present invention. Preferably, each section is about 50 to 5000 bp, about 100 
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to 5000 bp, about 100 to 3000 bp; 100 to 2000 bp; about 100 to 1000 bp; about 200 to 
4000 bp. about 400 to 3000 bp, about 500 to 2000 bp. and also about 800 to 1500 bp. 

In some embodiments, a DNA construct comprising a selective mariner and an 
inactivating chromosomal segment is assembled In \ntm, followed by direct cloning of said 
construct into a competent Bacillus host, sucii that the DNA constmct becomes Integrated 
into the Bacillus chromosome. For example, PCR fusion and/or ligation are suitable for 
assembling a DNA constmct in vitro. In some embodiments, the DNA constmct Is a non- 
plasmid constmct, while In other embodiments, it is Incorporated into a vector (/.©., a 
plasmid). In some embodiments, a circular plasmid is used, and the circular plasmid Is 
cut using an appropriate restriction enzyme {i.e., one that does not dismpt the DNA 
constmct). Thus, linear plasmids find use in the present Invention (See e.g.. Figure 1; and 
Perego. "integratlonal Vectors for Genetic Manipulation in Bacttlus subWis.' in SasiHus. 
siiM/is and other Gram-Positive Bacteria. Sonenshein. et al., Eds., Am. Soc. Microbiol., 
Washington, DC [1993]). 

In some embodiments, a DNA constmct or vector, preferably a plasmid Including 
an inactivating chromosomal segment includes a sufficient amount of the 5' and 3' flanking 
sequences (seq) of the Indigenous chromosomal segment or firagment thereof to provide 
for homologous recombination with the Indigenous chromosomal region or fragment 
thereof In the unaltered host In another embodiment, the DNA constmct includes 
restriction sites engineered at upstream and downstream ends of the constmct. Non- 
limiting examples of DNA constmcls useful according to the invention and identified 
according to the coordinate location Include: 

1. A DNA constmct for deleting a PBSX region: [5* flanking seq 1318874 - 
1319860 bp which includes the end ofyjqB and the entire yjpC including the ribosome 
binding site (RBS)] -marker gene - [3' flanking seq1348691 - 1349656 bp which includes a 
tenminator and upstream section of the pit ]. 

2. A DNA constmct for deleting a prophage 1 region: [5' flanking seq 201248 - 
2021 12 bp which contains the entire gImS including the RBS and terminator and the ybbU 
RBS] - marker gene - [3' flanking seq 220141 - 221 195 bp which includes the entire ybgd 
including the RBS]. 

3. A DNA constmct for deleting a prophage 2 region: [5' flanking seq 527925 - 
529067 bp which contains the end of ydcK, the entire tRNAs as follows: tmS-Asn. trnS- 
Ser, tmS-Glu, tmS-GIn, tmS-Lys, tmS-Leul and trnS-leu2] -marker gene - [3' flanking seq 
569578 - 571062 bp which contains the entire ydeK and upstream part of ycfeLJ. 
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4. A DNA construct for deleting a prophage 4 region: [5' flanl^ing seq 1263127 - 
1264270 bp which includes part oiyjcM]- marker gene - [3* flanlcing seq 1313660 - 
1314583 bp which contains part of yjoB including the RBS]. 

5. A DNA constmct for deleting a yvfF-yveK region: [5* flanking seq 3512061 - 
3513161 bp which includes part of s/gL. the entire yvfG and the start of yWF] -marker 
gene - [3* flanking seq 3528896 - 3529810 bp which includes the entire s/r and the start of 
pnbA. 

6. A DNA construct for deleting a DHB operon region: [5* flanking seq 3278457 - 
3280255 which includes the end of aid including the tenninator, the entire yuxl Including 
the RBS, the entire yukJ including the RBS and tenninator and the end olyukL] - marker 
gene - [3' flanking seq 3292919 - 3294076 which Includes the end of yuiH including the 
RBS. the entire yuiG including the RBS and tenninator and the upstream end oiyulF 
including the tenninator. 

Whether the DNA construct is incorporated into a vector or used without the 
presence of plasmid DNA, it is introduced into a microorganism, preferably an E co// cell 
or a competent Bacillus cell. 

Methods for introducing DNA into Bacillus cells involving plasmid constructs and 
transformation of plasmids into £ co// are well known. The plasmlds are subsequently 
isolated from £ coli and transfonned into Bacillus. However, it is not essential to use 
intervening microorganisms such as £ co//, and in some embodiments, a DNA construct 
or vector is directly introduced into a Bacillus host. 

In a pretended embodiment, the host cell is a Bacillus sp. (See e.g., U.S. Patent No. 
5,264,366, U.S. Patent No. 4,760,025, and RE 34,6060). In some embodiments, the 
Bacillus strain of interest is an alkalophilic Bacillus. Numerous alkalophilic Bacillus strains 
are known (See e.g., U.S. Patent 5,217,878; and Aunstmp et al., Proc IV IPS: Fennent. 
Tech. Today, 299-305 [1972]). Another type of Bacillus strain of particular interest is a cell 
of an industrial Bacillus strain. Examples of Industrial Bacillus strains Include, but are not 
limited to S. lichenlformis, B. lentus, B. subtilis, and a amyloliquefaciens. In additional 
embodiments, the Bacillus host strain is selected firom the group consisting of fi. 
licheniformis, B subtilis, B. lentus, B. brevis, B. steamttiermophilus, B. alkaloptiiius, B. 
amyloiiquelBciens, B. coaguians, B. circulans, B. pumilus, B. thuringiensis, B. clausil, and 
B. megaterium. In particulariy prefen^ed embodiments, B. subtilis ceWs are used. 

In some embodiments, the industrial host strains are selected from the group 
consisting of non-recombinant strains of Bacillus sp., mutants of a naturally-occurring 
Bacillus strain, and recombinant Bacillus host strains. Preferably, the host strain is a 
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recombinant host strain, wherein a poiynudeotide encoding a polypeptide of Interest has 
been previously introduced into the host A further prefenred host strain is a Bacillus 
subtllls host strain, and particularly a recombinant Bacillus subtilis host strain. Numerous 
a subtilis strains are known and suitable for use in the present invention (See e.g., 1 A6 
(ATCC 39085), 168 (1A01). SB19, W23, Ts85, B637, PB1753 through PB1758, PB3360. 
JH642, 1 A243 (ATCC 39,087), ATCC 21332, ATCC 6051 . IVII1 13, DE100 (ATCC 39,094), 
GX4931 , PBT 110, and PEP 21 Istrain; Hoch et ai.. Genetics, 73:215-228 [1973]; U.S. 
Patent No. 4,450,235; U.S. Patent No. 4,302,544; BP 0134048; Palva ef a/.. Gene, 19:81- 
87 [1982]; Fahnestocl^ and Fischer. J. Bacteriol., (1986) 165:796 - 804 [1986]; and Wang 
ef a/.. Gene 69:39-47 [1988]). Of particular interest as expression hosts are industrial 
protease-producing Bacillus strains. By using these strains, the high efficiency seen for 
production of the protease is further enhanced by the altered flac///us strain of the present 
Invention. 

Industrial protease producing Bacillus strains provide particularly prefemed 
expression hosts. In some preferred embodiments, use of these strains in the present 
invention provides further enhancements in efficiency and protease production. As 
indicated above, there are two general types of proteases are typically secreted by 
Bacillus sp., namely neutral (or "metalloproteases") and alkaline (or "serine") proteases. 
Also as indicated above, subtilisin is a preferred serine protease for use in the present 
Invention. A wide variety of Bacillus subtilisins have been identified and sequenced, for 
example, subtilisin 168, subtilisin BPN\ subtilisin Carisberg, subtilisin DY. subtilisin 147 
and subtilisin 309 (See e.g., EP 414279 B; WO 89/06279; and Stahl et aL, J. Bacteriol., 
159:81 1-818 [1984]). In some embodiments of the present invention, the Bacillus host 
strains produce mutant (e.g.. variant) proteases. Numerous references provide examples 
of variant proteases and reference (See e.g., WO 99/20770; WO 99/20726; WO 99/20769; 
WO 89/06279; RE 34.606; U.S. Patent No. 4.914,031; U.S. Patent No. 4.980.288; U.S. 
Patent No. 5,208.158; U.S. Patent No. 5.310,675; U.S. Patent No. 5.336,611; U.S. Patent 
No. 5.399.283; U.S. Patent No. 5,441,882; U.S. Patent No. 5.482,849; U.S. Patent No. 
5.631,217; U.S. Patent No. 5.665,587; U.S. Patent No. 5,700,676; U.S. Patent No. 
5,741,694; U.S. Patent No. 5.858.757; U.S. Patent No. 5,880,080; U.S. Patent No. 
6,197,567; and U.S. Patent No. 6,218,165. 

In yet another embodiment, a prefeaed Bacillus host is a Bacillus sp. that includes 
a mutation or deletion in at least one of the following genes, degU, degS, degR and ofegQ. 
Preferably the mutation is in a degil gene, and more preferably the mutation is 
degU(Hy)32. {See, Msadek ef a/., J. Bacteriol., 172:824-834 [1990]; and Olmos ef a/., 
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Mol. Gen. Genet. 253:562-567 [1997]). A most preferred host strain is a Bacillus subtilis 
carrying a degU32(Hy) mutation. In a further embodiment, the Bacillus host comprises a 
mutation or deletion In scoC4, (See, Caldwell ef a/.. J. Bacteriol., 183:7329-7340 [2001]); 
spollE (See, Arigoni et aL, Mol. Microbiol., 31 :1407-1415 [1999]); oppA or other genes of 
the opp operon (See, Perego et al, Mol. Microbiol.. 5:173-185 [1991]). Indeed, It is 
contemplated that any mutation in the opp operon that causes the same phenotype as a 
mutation in the oppA gene will find use in some embodiments of the altered Bacillus strain 
of the present invention. In some embodiments, these mutations occur alone, while in 
other embodiments, combinations of mutations are present. In some embodiments, an 
altered Bacillus of the invention is obtained from a Bacillus host strain that already 
includes a mutation in one or more of the above-mentioned genes. In alternate 
embodiments, an altered Bacillus of the invention is further engineered to include mutation 
in one or more of the above-mentioned genes. 

In some embodiment, two or more DNA constructs are introduced into a Bacillus 
host cell, resulting in the deletion of two or more indigenous chromosomal regions in an 
altered Bacillus. In some embodiments, these regions are contiguous, (e.g., the skin plus 
prophage 7 region), while in other embodiments, the regions are separated (e.g., the 
PBSX region and the PKS region; the skin region and the DHB region; or the PKS region, 
the SPp region and the yvfF-yveK region). 

Those of skill in the art are well aware of suitable methods for Introducing 
polynucleotide sequences into bacterial {e.g., E coll and Bacillus) cells (See e.g., Ferrari 
ef a/., "Genetics," In Hanvood etal. (ed.), Bacillus. Plenum Publishing Corp. [1989], pages 
57-72; See also, Saunders etal., J. Bacteriol., 157:718-726 [1984]; Hoch etal., J. 
BacterioL, 93:1925 -1937 [1967]; Mann etal., Cunrent Microbiol., 13:131-135 [1986]; and 
Holubova, Folia Microbiol., 30:97 [1985]; for fi. subtilis, Chang etaL, Mol. Gen. Genet, 
168:1 1-1 15 [1979]; for B. megaterium, Vorobjeva ef a/., FEMS Microbiol. Lett., 7:261-263 
[1980]; for 8 amyloliquefaciens, Smith et ah, Appl. Env. Microbiol., 51 :634 (1986); for 8. 
thuringiensis, Fisher etal., Arch. Microbiol., 139:213-217 [1^81]; and for 8. sphaericus, 
McDonald, J. Gen. Microbiol., 130:203 [1984]). Indeed, such methods as transformation 
including protoplast transformation and congression, transduction, and protoplast fusion 
are known and suited for use in the present invention. Methods of transformation are 
particularly prefen-ed to introduce a DNA construct provided by the present invention into a 
host cell. 

In addition to commonly used methods, in some embodiments, host cells are 
directly transfomied (/.e., an intermediate cell is not used to amplify, or othenwise process, 
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the DNA construct prior to introduction into tlie liost cell). Introduction of tlie DNA 
constmct into the host cell includes those physical and chemical methods known in the art 
to introduce DNA into a host cell, without insertion into a plasmid or vector. Such methods 
include but are not limited to calcium chloride precipitation, electroporation, nalced DNA, 
liposomes and the lil^e. In additional embodiments. DNA constmcts are co-transfonned 
with a plasmid without being inserted into the plasmid. In a further embodiments, a 
selective maricer is deleted or substantially excised from the altered Bacillus strain by 
methods l^nown In the art (See, Stahl ef a/.. J. BacterioL, 158:411-418 [1984]; and the 
conservative site-specific recombination [CSSR] method of Palmeros etal., described in 
Palmeros ef a/.. Gene 247:255 -264 [2000]). In some prefenred embodiments, resolution 
of the vector from a host chromosome leaves the flanldng regions in the chromosome 
while removing the indigenous chromosomal region. 

In some embodiments, host cells are transfomied with one or more DNA 
constructs according to the present invention to produce an altered Bacillus strain wherein 
two or more genes have been inactivated in the host cell. In some embodiments, two or 
more genes are deleted from the host cell chromosome. In alternative embodiments, two 
or more genes are inactivated by insertion of a DNA construct. In some embodiments, the 
inactivated genes are contiguous (whether inactivated by deletion and/or insertion), while 
in other embodiments, they are not contiguous genes. 

As indicated above, there are various assays known to those of ordinary skill in the 
art for detecting and measuring activity of intracellulariy and extracellulariy expressed 
polypeptides. In particular, for proteases, there are assays based on the release of acid- 
soluble peptides from casein or hemoglobin measured as absorbance at 280 nm or 
colorimetrically using the Folin method (See e.g., Bergmeyer et ai, "Methods of Enzymatic 
Analysis" vol. 5, Peptidases. Proteinases and their Inhibitors . Veriag Chemie, Weinheim 
[1984]). Other assays involve the solubilization of chromogenic substrates (See e.g., Ward, 
"Proteinases," in Fogarty (ed.)., Microbial Enzymes and Biotechnoioqv . Applied Science, 
London, [1983], pp 251-317). Other exemplary assays include succinyl-Ala-Ala-Pro-Phe- 
para nitroanilide assay (SAAPFpNA) and the 2,4,6-trlnitrobenzene sulfonate sodium salt 
assay (TNBS assay). Numerous additional references known to those in the art provide 
suitable methods (See e.g. , Wells ef ai, Nucleic Acids Res. 1 1 :791 1-7925 [1 983]; 
Christianson et al, Anal. Biochem., 223:1 19-129 [1994]; and Hsia et al., Anal Biochem., 
242:221-227 [1999]). 

Also as indicated above, means for detennining the levels of secretion of a protein of 
interest in a host cell and detecting expressed proteins include the use of immunoassays with 
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either polyclonal or monoclonal antibodies specific for the protein. Examples include 
enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), fluorescence 
immunoassay (FIA), and fluorescent activated cell sorting (FACS). However, other methods 
are known to those in the art and find use in assessing the protein of interest (See ag., 
Hampton et aL, Serological Methods. A Laboratorv Manual . APS Press, St. Paul, MN [1990]; 
and Maddox et al., J. Exp. Med., 158:121 1 [1983]). In some prefen-ed embodiments, 
secretion of a protein of interest is higher in the altered strain obtained using the present 
invention than in a corresponding unaltered host As known in the art, the altered Bacillus 
cells produced using the present invention are maintained and grown under conditions 
suitable for the expression and recovery of a polypeptide of interest from cell culture (See 
e.g., Hardwood and Cutting (eds.) Molecular Biological Methods for Bacillus, John Wiley & 
Sons [1990]). 

The manner and method of canrying out the present invention may be more fully 
understood by those of skill in the art by reference to the following examples, which examples 
are not intended in any manner to limit the scope of the present Invention or of the claims 
directed thereto. 

EXPERIMENTAL 

The following Examples are provided in order to demonstrate and further illustrate 
certain preferred embodiments and aspects of the present invention and are not to be 
construed as limiting the scope thereof. 

In the experimental disclosure which follows, the following abbreviations apply: ''C 
(degrees Centigrade); rpm (revolutions per minute); H2O (water); dHaO (deionized water); 
(HCI (hydrochloric acid); aa (amino acid); bp (base pair); kb (kilobase pair); 
kD (kilodaltons); gm (grams); (jg (micrograms); mg (milligrams); ng (nanograms); 
pi (microliters); ml (milliliters); mm (millimeters); nm (nanometers); pm (micrometer); M 
(molar); mM (miilimolar); pM (micromolar); U (units); V (volts); MW (molecular weight); 
sec (seconds); min(s) (minute/minutes); hr(s) (hour/hours); MgCIa (magnesium chloride); 
NaCI (sodium chloride); OD280 (optical density at 280 nrn); ODeoo (optical density at 600 
nm); PAGE (polyacrylamide gel electrophoresis); PBS (phosphate buffered saline [150 
mM NaCI, 10 mM sodium phosphate buffer, pH 7.2]); PEG (polyethylene glycol); PCR 
(polymerase chain reaction); RT-PCR (reverse transcription PCR); SDS (sodium dodecyl 
sulfate); Tris (tris(hydroxymethyl)aminomethane); w/v (weight to volume); v/v (volume to 
volume); LA medium (per liter: Difco Tryptone Peptone 20g, Difco Yeast Extract lOg, EM 
Science NaCI 1g, EM Science Agar 17.5g, dH20 to 1L); ATCC (American Type Culture 
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CollecHon, Rockville, MD); Clontech (CLONTECH Laboratories, Palo Alto, CA); Difco 
(Difco Laboratories, Detroit, Ml); GIBCO BRL or Gibco BRL (Life Technologies, Inc., 
Gaithersburg, MD); Invitrogen (Invitrogen Corp., San Diego, CA); NEB (New England 
Biolabs, Beverly, MA); Sigma (Sigma Chemical Co., St Louis, MO); Takara (Takara Bio 
Inc. Otsu, Japan); Roche Diagnostics and Roche (Roche Diagnostics, a division of F. 
Hoffmann La Roche, Ltd., Basel, Switzeriand); EM Science (EM Science, Gibbstown, N J); 
Qiagen (Qiagen, Inc., Valencia, CA); Stratagene (Stratagene Cloning Systems, La Jolla, 
CA); Affymetrix (Affymetrix, Santa Clara, California). 

EXAMPLE 1 

Creation of Deletion Strains 

This Example describes "Method 1," which Is also depicted in Figure 1 . In this 
method, E. co// was used to produce a pJM102 plasmid vector carrying the DNA construct 
to be transformed into Sac/7/i/s strains. (See, Perego, supra). Regions immediately 
flanking the 5' and 3* ends of the deletion site were PCR amplified. PCR primers were 
designed to be approximately 37 base pairs in length, including 31 base pairs homologous 
to the Bacillus subtilis chromosome and a 6 base pair restriction enzyme site located 6 
base pairs from the 5' end of the primer. Primers were designed to engineer unique 
restriction sites at the upstream and downstream ends of the construct and a BamHl site 
between the two fragments for use in cloning. Primers for the antimicrobial markers 
contained 8amHI sites at both ends of the fragment. Where possible, PCR primers were 
designed to remove promoters of deleted indigenous chromosomal regions, but to leave 
all terminators In the immediate area. The primary source of chromosome sequence, 
gene localization, and promoter and temiinator information was obtained from Kunst et al., 
(1997) supra and also obtainable from the SubtiList Worid Wide Web Sen/er known to 
those in the art {See e.g., Moszer et ah, supra). Numerous deletions have been made 
using the present Invention. A list of primer sequences from deletions created by this 
method is provided in Table 1. Reference is also made to Figure 2 for an explanation of 
the primer naming system. 
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Table 1. Primers 



Primer 

name 


Restriction 
Enzyme 

Into Primer 


Primer Sequence 


SEQ ID 
NO 


PBSX-UF 


Xbal 


CTACATTCTAGACGATTTGTTTGATCGATATGTGGAAGC 


60 


PBSX-UR 


BamHI 


GGCTGAGGATCCATTCCTCAGCCCAGAAGAGAACCTA 


61 


PBSX-DF 


BamHI 


TCCCTCGGATCCGAAATAGGTTCTGCTTATTGTATTCG 


62 


PBSX-DR 


Sad 


AGCGTTGAGCTCGCGCCATGCCATTATATTGGCTGCTG 


63 


Pphage 1- 


EcoRI 


GTGACGGAATTCCACGTGCGTCTTATATTGCTGAGCTT 


64 


Pphage 1- 


BamHI 


CGTTTTGGATCCAAAAACACCCCTTTAGATAATCTTAT 


65 


Pphage 1- 


BamHI 


ATCAAAGGATCCGCTATGCTCCAAATGTACACCTTTCCGT 


66 


Pphage 1- 


PstI 


ATATTTCTGCAGGCTGATATAAATAATACTGTGTGTTCC 


67 


Pphage 2- 


Sad 


CATCTTGAATTCAAAGGGTACAAGCACAGAGACAGAG 


68 


Pphage 2- 


BamHt 


TGACTTGGATCCGGTAAGTGGGCAGTTTGTGGGCAGT 


69 


Pphage 2- 


BamHI 


TAGATAGGATCCTATTGAAAACTGTTTAAGAAGAGGA 


70 


Pphage 2- 


PstI 


CIGAI ICIGCAGGAGIGI 1 1 1 IGAAGGAAGCI ICAI 1 


71 


Pphage 4- 


Kpnl 


CTCCGCGGTACCGTCACGAATGCGCCTCTTATTCTAT 


72 


Pphage 4- 


BamHI 


ICGCIGGGAICCI I GGCGCCG 1 GGAA 1 CGA 1 1 1 IGICC 


73 


Pphage 4- 


BamHI 


GCAATGGGATCGTATATGAACGGTTATGAATTCACAA 


74 


Pphage 4- 


PstI 


CCAGAACTGCAGGAGCGAGGGGTCTCGCTGCCTGAAA 


75 


PPS-UF 


Sad 


GACAAGGAGCTCATGAAAAAAAGCATAAAGCTTTATGTTGC 


76 


PPS-UR 


BamHI 


GACAAGGGATCCCGGCATGTCCGTTATTACTTAATTTC 


77 


PPS-DF 


BamHI 


GACAAGGGATCCTGCCGqTTACCGGAAACGGA 


78 


PPS-DR 


Xbal 


GACAAGTCTAGATTATCGTTTGTGCAGTATTACTTG 


79 


SPp-UF 


Sad 


A /*%Tr A A ^ /'NT'/^T/^ ^^/^T A A A ^ A AAA ^ A A ^ A A ^ 

ACTGATGAGCTCTGGCTAAACAGCAAACAGCAGAAC 


80 


SPp-UR 


BamHI 


A A A T*^/^ A "¥"/^^ AT^ AT A A A /^O/^/^^ A A ^ A 'I 1' A A A TAT 

ACGAATGGATCGATCATAAAGCCGCAGCAGATTAAATAT 


All 

81 


SPp-DF 


BamHI 


ACTGATGGATCCATCTTCGATAAATATGAAAGTGGC 


82 


SPp-DR 


Xbal 


ACTGATTCTAGAGGC 1 1 1 1 1 CTCTTGATGCAATTCTTC 


83 


PKS-UF 


Xbal 


GAGCCTCTAGAGCCCATTGAATCATTTGTTT 


84 


PKS-UR 


BamHI 


GAGCCGGATCCTTAAGGATGTCG 1 II 1 IGTGTCT 


85 


PKS-DF 


BamHI 


GAGCCGGATCCA 1 \ \ CGGGGTTCTCAAAAAAA 


86 


PKS-DR 


Sad 


^ A ^^^^ A ^^^T^ AT^^^ A A A '1 ^ A A A A ATT^% AT 

GAGCCGAGCTCATGCAAATGGAAAAATTGAT 


87 


Skin-UF 


Xbal 


GAAGTTCTAGAGATTGTAATTACAAAAGGGGGGTG 


88 


Skin-UR 


BamHI 


GAAGTGGATCCTTTCACCGATCATAAAAGCCC 


89 


Skin-DF 


BamHI 


TGAAAGGATCCAI 1 1 1 1 CATTGATTGTTAAGTC 


90 


Skln-DR 


Sad 


GAAGTTAGAGGTCGGGGGGGCATAAATTTCCCG 


91 


Phleo-UF 


BamHI 


GCTTATGGATCCGATACAAGAGAGGTCTCTCG 


92 


Phleo-DR 


BamHI 


GCTTATGGATCCCTGTCATGGCGCATTAACG 


93 


Spec-UF 


BamHI 


ACTGATGGATCCATCGATTTTCGTTCGTGAATACATG 


94 


Spec-DR 


BamHI 


ACTGATGGATCCCATATGCAAGGGTTTATTGTTTTC 


95 


CssS-UF 


Xbal 


GCACGTTCTAGACCACCGTCCCCTGTGTTGTATCCAC 


96 


CssS-UR 


BamHI 


AGGAAGGGATCCAGAGCGAGGAAGATGTAGGATGATC 


97 


CssS-DF 


BamHI 


TGACAAGGATCCTGTATCATACCGCATAGCAGTGCC 


98 
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CssS-DR 


Sacl 


TTCCGCGAGCTCGGCGAGAGCTTCAGACTCCeTCACjA 




SBO- 


Xbal 


GAGCCTCTAGATCAGCGATTTGACGGGGCec 


inn 


SBO- 


BamHI 


TTATCTGGATCGCTGATGAGCAATGATeGTAAC^A I AU>A 




SBO- 


BamHI 


GGGTAA GGATCC GCCAAAAGGGCA FACi 1 UA 1 I AU i 




SdO- 


nSpf lO 


GAGATCGGTACC CTTTTGGGCCATATCGTGGA M I u 


103 


PhrC-UF 


Hindlli 


GAGCC AAGCTT CATTGACAGCAACCAGGCAGATCTC 


104 


PhrC-DF 


PstI 


GCTTATAAGCTTGATACAAGAGAGGTCTCTCG 


105 


PhrC-UR 


PstI 


GCTTATAAGCTTCTGTCATGGCGCATTAACG 


106 


PhrC-DR 


Sacl 


GAGCCGAGCTC CATGCCGATGAAGTCATCGTCGAGC 


107 


PhrC-UF- 


Hindi!! 


CGTGAA AAGCTT TCGCGGGATGTATGAAl 1 liiAFAAG 


108 


PhrC-DR- 


Sacl 


TGTAGGGAGCTCGATGCGCCACAATGTCGGTACAACG 


109 



The restriction sites are designated as follows: Xbal is TCTAGA; BamHI Is GGATCC; Sad Is 
GAGCTC; Asp71Q Is GGTACC; Psfl is CTGCAG and Hlnd\\\ is AAGCTT. Also prophage is 
designated as "Pphage." 

s 

In this method, 100 \iL PGR reactions carried out in 150^L Eppendorf tubes 
containing 84^L water, 10tiL PGR buffer, 1^L of each prinner [i.e., PKS-UF and PKS- 
UR), 2nL of dNTPs, 1 \xL of wild type Bacillus chromosomal DN A template, and 1 nL of 
polymerase. DNA polymerases used induded Taq Plus Predsion polymerase and 
10 Herculase (Stratagene). Reactions were carried out in a Hybaid PGRExpress 

thermocyder using the following program. The samples were first heated at 94°G for 5 
minutes, then cooled to a 50** hold. Polymerase was added at this point. Twenty-five 
cycles of amplification consisted of 1 minute at QS^G, 1 minute at 50°C and 1 minute at 
72''C. A final 10 minutes at 72*'C ensured complete elongation. Samples were held at 

IS 4<'C for analysis. 

After completion of the PGR, ^0^^L of each reaction were run on an Invitrogen 
1 .2% agarose E-gel at 60 volts for 30 minutes to check for the presence of a band at 
the correct size. All the gel electrophoresis methods described herein used these 
conditions. If a band was present, the remainder of the reaction tube was purified using 

20 the Qiagen Qiaquick® PGR purification kit according to the manufacturer's instructions, 
then cut with the appropriate restriction enzyme pair. Digests were perfomied at 37°G 
for 1 hour as a 20 |iL reaction consisting of 9(iL of water, 2jiL of 1 0xBSA. 2\iL of an 
appropriate NEB restriction buffer (according to the 2000-01 NEB Gatalog and 
Technical Reference), 5 of template, and 1 \iL of each restriction enzyme. For 

25 example, the PBSX upstream fragment and GssS upstream fragments were cut with 
Xba\ and BamHI in NEB (New England BioLabs) restriction buffer B. The digested 
fragments were purified by gel electrophoresis and extraction using the Qiagen 
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Qiaquick gel extraction kit foilowing the manufacturer's instructions. Figures 5 and 6 
provide gels showing the results for various deletions. 

Ligation of the fragments into a plasmid vector was done in two steps, using either 
the Takara ligation kit following the manufacturer's instructions or T4 DNA ligase (Reaction 
contents: 5 each insert fragment. 1jiL cut pJM102 plasmid. 3 T4 DNA ligase buffer, 
and 1 \iL T4 DNA ligase). First, the cut upstream and downstream fragments were ligated 
ovemight at IS^'C into unique restriction sites in the pJM102 plasmid polylinker, connecting 
at the common BamHI site to re-fomi a circular plasmid. The pJM102 plasmid was cut with 
the unique restriction enzyme sites appropriate for each deletion (See, Table 2; for cssS, 
Xba\ and Sad were used) and purified as described above prior to ligation. This re- 
circularized plasmid was transformed into Invitrogen's "Top Ten" £ co// cells, using the 
manufacturers One Shot transformation protocol. 

Transfomiants were selected on Luria-Bertani broth solidified with1.5% agar (LA) 
plus 50 ppm carbanicillin containing X-gal for blue-white screening. Clones were picked 
and grown overnight at 37**C in 5mL of Luria Bertani broth (LB) plus 50 ppm carbanicillin 
and plasmids were isolated using Qiagen's Qiaquick Mini-Prep kit. Restriction analysis 
confirmed the presence of the insert by cutting with the restriction sites at each end of the 
insert to drop an approximately 2 kb band out of the plasmid. Confinned plasmids with the 
insert were cut with BamH\ to linearize them in digestion reiactions as described above 
(with an additional 1 of water in place of a second restriction enzyme), treated with 1 jiL 
calf intestinal and shrimp phosphatases for 1 hour at 37*^0 to prevent re-circularization, 
and ligated to the antimicrobial resistance marker as listed in Table 2. Antimicrobial 
markers were cut with eamHI and cleaned using the Qiagen Gel Extraction Kit following 
manufacturer's instructions prior to ligation. This plasmid was cloned into E co// as 
before, using 5 ppm phleomycin (phi) or 100 ppm spectinomycin (spc) as appropriate for 
selection. Confirmation of marker insertion in isolated plasmids was done as described 
above by restriction analysis with BamHI. Prior to transfonnation into a subtilis, the 
plasmid was linearized with Seal to ensure a double crossover event. 



Table 2. Unique Restriction Enzyme Pairs Used In Deietlon Constructs 



Deletion Name 


Unique Restriction Enzyme Pair 


Antimicrobial Marker 


Sbo 


Xbal-Asp718 


spc 


Sir 


Xbal - Sad 


phleo 
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YbcO 


Xbal-SacI 


spc 


Csn 


Xbai - Sail 


nhlpn 


PBSX 


Xbal-SacI 


phi 


PKS 


Xbal-SacI 


phi 


SPfi 


Xbal-SacI 


spec 


PPS 


Xbal-SacI 


spec 


Skin 


Xbal-SacI 


phi 


EXAMPLE 2 

Creation of DNA Constructs Uslna PGR Fusion to Bvoass E. coll 



This Example describes "Method 2," which is also depicted in Figure 3. Upstream 
and downstream fragments were amplified as In Method 1 , except the primers were 
designed with 25 bp "tails" complementary to the antimicrobial mariner's primer sequences. 
A "tail" is defined herein as base pairs on the 5' end of a primer that are not homologous to 
the sequence being directly amplified, but are complementary to another sequence of 
DNA. Similariy, the primers for amplifying the antimicrobial contain "tails" that are 
complementary to the fifagments" primers. For any given deletion, the DeletionX-UFfus 
and DeletionX-URfus are direct complements of one another. This is also true for the DF- 
fus and DR-fus primer sets. In addition, In some embodiments, these primers contain 
restriction enzyme sites similar to those used In Method 1 for use in creating a plasmW 
vector (See. Table 3 and U.S. Patent No. 5,023,1 71 ). Table 3 provides a list of primers 
useful for creation of deletion constmcts by PCR fusion. Table 4 provides an additional list 
of primers useful for creation of deletion constructs by PCR fusion. However, in this 
Table, ail deletion constructs would include the phleo'' maricer. 



Tables. Primers 



Primer name 


Restriction 

enzyme 
engineered 
into primer 


Sequence 


SEQ 
ID. NO. 


DHB-UF 


Xbal 


CGAGAATCTAGAACAGGATGAATCATCTGTGGCGGG 


110 


DIHB-UFfus-phieo 


BamHI 


CGACTGTCCAGCCGCTCGGCACATCGGATCCGCTTA 
CCGAAAGCCAGACTCAGCAA 


111 


DHB-URfus-phleo 


BaiTiHI 


TTGCTGAGTCTGGCTTTCGGTAAGCGGATCCXSATGTG 
CCGAGCGGCTGGACAGTCG 


112 
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DHB-DFfus-phleo 


BamHI 


CGTTAATGCGCCATGACAGCCATGAGGATCCCACAA 
GCCCGCACGCCTTGCCAUAU 


113 


DHB-DRfus-phleo 


BamHI 


GTGTGGCAAGGCGTGCGGGGTTGTGGGATCCTCATG 
GCTGTCATGGCGCATTAACG 


114 


DHB-DR 


Sad 


GACTTGGTCGACGAGTGCGGACGGGCAGCATCACCA 


115 


DHB-UF-nested 


Xbal 


GGCATATCTAGAGAUA 1 isAAijl^olaoAAML/MVjM i va 


116 


DHB-DR-nested 


Sad 


GGTGCGGAGCTCGACAGTATCACAGCCAGCGCTG 


117 


YvfF-yveK-UF 


Xbal 


AAGCGTTCTAGACTGCGGATGCAGATCGATCTCGee 


'lift 

no 


YvfF-yveK-UF- 
phleo 


BamHI 


AACCTTCCGCTCACATGTGAGCAGGGGATCC 
GCTTACCGAAAGCCAGACTCAGCAA 


119 


YvfF-yveK-UR- 
phleo 


BamHI 


TTGGTGAGTGTGGCTTTCGGTAAGGGGATCC 
CCTGCTCACATGTGAGCGGAAGGTT 


120 


YvfF-yveK-DF- 
phleo 


BamHI 


CGTTAATGCGCCATGAOAoOUA 1 oAovsM i K^Vf 
GCCTTCAGCCTTCCCGCGGCTGGCT 


121 


YvfF-yveK-DR- 


BamHI 


TCATGGCTGTCATGGCGCATTAACG 


122 


YvfF-yveK-DR 


Pstl 


GAAGCACTGCAGCCCACACTTCAGGCGGCTCAGGTC 


123 


YvfF-yveK-UF- 


Xbal 


nan ATATPTA A ATGGTATG A AGCGGAATTCCCG 


124 


YvfF-yveK-DR- 


Kpnl 


ATAAACGGTACCCCCCTATAGATGCGAACGTTAGCCC 


125 


Prophage7-UF 


EcoRI 


'*' Ay^<-»ArN^ A AiTTr^/^AT^'^TT/^ A^OTAXA^A AAPAf2Tr*AT 

AAGGAGG AATTCCATCTi vsAiao 1 A 1 AUAAAUA« 1 i 




Prophage 7-UF- 


BamHI 


TCTCCGAGAAAGACAGGCAGGATCGGGATCC 


127 


Prophage 7-UR- 


BamHI 


TTGCTGAGTCTGGCTTTCGGTAAGCGGATCC 


128 


Skin+prophageT- 


Asp718 


AAGGACGGTACCGGCTCATTACCCTCTTTTCAAGGGT 


129 


Skln+pro7-UF- 
phleo 


BamHI 


ACCAAAGCCGGACTCCCCCGCGAGAGGATCC 
GCTTACCGAAAGCCAGACTCAGCAA 


130 


Skln+pro7-UR- 
phleo 


BamHI 


TTGCTGAGTCTGGC 1 i 1 CGGTAAGCGGATCC 
TCTCGCGGGGGAGTCCGGCTTTGGT 


131 


Skln+pro7-DF- 
phleo 


BamHI 


CGTTAATGCGCCATGACAGCCATGA 
GGATCCCATACGGGGTACACAATGTACCATA 


132 


Skln+pro7-DR- 
phleo 


BamHI 


TATGGTACATTGTGTACCCCGTATGGGATCC 
TCATGGCTGTCATGGCGCATTAACG 


133 


Skin+pro7-DR 


Pstl 


GTCAACCTGCAGAGCGGCCCAGGTACAAGTTGGGGA 


134 


Skln+pro7-UF- 


Sad 


GGATCAGAGCTCGCTTGTCCTCCTGGGAACAGCCGG 


135 


Skln+pro7-DR- 


Pstl 


TATATGCTGCAGGGCTCAGACGGTACCGGTTGTTCCT 


136 



The restriction sites are designated as follows: Xba\ is TCTAGA; Ba/nHl Is GGATCC; Sad is GAGCTC; 
>»sp718 Is GGTACC: Psti Is CTGCAG and H/ndlll Is AA6CTT. 
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Table 4. Additional Primers Used to Create Deletion Constructs 
by PCR Fusion*. 



Primer Name 


Restriction 
Enzyme 

cny 111001 w 

Into Primer 


Sequence 


SEQ 
ID 
NO* 


SIr-UF 


Xbal 


CTGAACTCTAGACCTTCACCAGGCACAGAGGAGGTGA 


137 


Slr-Uffus 


BamHI 


GCCAATAAGTTCTCT I I AijAo AAOAIjom i 
GCTTACCGAAAGCCAGACTCAGCAA 


138 


SIr-Urfus 


BamHI 


TTGCTGAGTCTGGCTTTCGGTAAGCGGATCGTTGTTGTGT 
AAAGAGAACTTATTGec 


139 


Slr-Dffus 


BamHI 


CGTTAATGCGCCATGACAGCCATGAGGATCC 

^^rsK^A A^/^TTTT/^/^/^AT/^XAXAf^^r^^^ 
GGGCTAACoTTOCaOA I O l A l AoooVj 


140 


SIr-Drfus 


BamHI 


CCGCTATAGATGCGAAGGTTAGCCC GGATCC 
TCATGGCTGTCATGGCGGATTAACG 


141 


SIr-DR 


Sad 


TGAGACGAGCTCGATGCATAGGCGACGGCAGGGCGCC 


142 


SIr-UF- nested 


Xbal 


CGAAATTCTAGATCCCGCGATTGCGCCCi I IGIGG 


143 


Slr-DR-nested 


oaci 


xxrrAAGAGCTCGCGGAATACGGGAAGCAGCCCC 


144 


YbcO-UF 


ADai 


PAAXXCXCXAGAGCGGTCGGCGCAGGTATAGGAGGGG 


145 


YbcO-UF 


bam Ml 


riAAAAGAAACCAAAAAGAATGGGAAGGATCC 
GCTTACCGAAAGCCAGACTCAGCAA 


146 


YbcO-UR 


Roml-ll 

DamMl 


TTGCTGAGTCTGGCTTTCGGTAAGCGGATCC 
TTCCCATTCTTTTTGGTTTCTTTTC 


147 


YDCO-Dr 


Da mm 


CGTTAATGCGCCATGACAGCCATGAGGATCC 
GCTATTTAACATTTGAGAATAGGGA 


148 


YDCO-UR 


Ooml-ll 

Damm 


TCCCTATTCTCAAATGTTAAATAGCGGATCC 
TCATGGCTGTCATGGCGGATTAACG 


149 


YDCU-UK 




CAGGCGGAGCTCCCAI I lATGACGTGCTTCCCTAAGC 


150 


osn-ur 


Vhal 


"tacgaatctagagatcattgcggaagtagaagtggaa 


151 


usn-ur 


BamHI 
Daiiirii 


TTTAGATTGAGTTCATCTGCAGCGGGGATCC 
GCTTACCGAAAGCCAGACTCAGCAA 


152 


Csn-UR 


BamHI 


iXGCTGAGTCTooU 1 1 I Ooo i mmouoom i 
CCGCTGCAGATGAACTCAATCTAAA 


153 


Csn-DF 


BamHI 


r^r^TTA Axrani^PPAXriAPAfiPnAXGAGGATCC 
GCCAATCAGCCTTAGCCCCTCTCAC 


154 




BamHI 


GTGAGAGGGGCTAAGGCTGATTGGCGGATCC 
TCATGGCTGTCATGGCGGATTAACG 


155 


Csn-DR 


Sail 


ATACTCGTCGACATACGTTGAATTGCCGAGAAGCCGC 


156 


Csn-UF- 


NA 


CTGGAGTACCTGGATCTGGATCTCC 


157 


Csn-DR- 


NA 


GGTCGGCTTGTTTCAGCTCATTTCC 


158 


SlgB-UF 


Sacl 


CGGTTTGAGCTCGCGTCCTGATCTGCAGAAGCTCATT 


159 


SigB-UF 


BamHI 


CTAAAGATGAAGTCGATCGGCTCATGGATCC 
GCTTACCGAAAGCCAGACTCAGCAA 


160 


SigB-UR 


BamHI 


TTGCTGAGTCTGGCTTTCGGTAAGCGGATCC 
ATGAGCCGATCGACTTCATCTTTAG 


161 
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SigB-DF 


BamHl 


CGTTAATGCGCCATGAUAoUUA 1 oAovpM i 
GAAGATCCCTCGATGGAGTTAATGT 


162 


SigB-DR 


BamHI 


ACATTAACTCCATCGAGGGATCTTCGGATCC 
TCATGGCTGTCATGGCGCATTAACG 


lOO 


SigB-DR 


Sail 


GCTTCGGTCGACTTTGCCGTCTGGATATGCGTCTCTCG 


164 


SigB-UF- 


Sacl 


GTCAAAGAGCTCTATGACAGCCTCCTCAAATTGCAGG 


165 


oiy D 1-^1 \ 


Sail 


TTCCATGTCGACGCTGTGCAAAACCGCCGGCAGCGCC 


166 


SpollSA-UF 


EcoRI 


ACATTCGAATTCA6CAGGTCAATCAGCTCGCTGACGC 


167 


bpOnoA-Ur 


Darnm 


CCAGCACTGCGCTCCCTCACCCGAAGGATCC 
GCTTACCGAAAGCCAGACTCAGCAA 


168 


SpotlSA-UR 


BamHI 


TTGCTGAGTGTGGCTTTCGGTAAGCGGATCC 
TTCGGGTGAGGGAGCGCAGTGCTGG 


169 


SpollSA-DF 


BamHl 


CGTTAATGGGCCATGACAGCCATGAGGATCC 
TCGAGAGATCCGGATGGTTTTCCTG 


170 


SpoltSA-DR 


BamHI 


CAGGAAAACCATCCGGATCTCTCGAGGATGC 
TCATGGCTGTCATGGCGCATTAACG 


171 


SpollSA-DR 


HIndlll 


AGTCAT AAGCT7TCTGGCGTTTGATTTCATCAACGGG 


172 


SpollSA4JF- 


NA 


CAGCGCGACTTGTTAAGGGACAATA 


173 


SpollSA-DR- 


NA 


GGCTGCTGTGATGAACTTTGTCGGA 


174 



*AII deletion constructs Include the phleo'* marker 



The fragments listed in Tables 3 and 4 were size-verified by gel electrophoresis as 
described above. If conrect, 1 p.L each of the upstream, downstream, and antimicrobial 
resistance marker fragments were placed in a single reaction tube with the DeletionX-UF 
and DeletionX-DR primers or nested primers where listed. Nested primers are 25 base 
pairs of DMA homologous to an intemal portion of the upstream or downstream fragment, 
usually about 100 base pairs from the outside end of the fragment (See, Figure 2). The 
use of nested primers frequently enhances the success of fusion. The PGR reaction 
components were similar to those described above, except 82 of water was used to 
compensate for additional template volume. The PGR reaction conditions were similar to 
those described above, except the 72^0 extension was lengthened to 3 minutes. During 
extension, the antimicrobial resistance gene was fused in between the upstream and 
downstream pieces. This fusion fragment can be directly transformed into Bacillus without 
any purification steps or with a simple Qiagen Qiaquick PRC purification done according to 
manufacturer's instructions. 
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EXAMPLE 3 

Creation of DMA Constructs Using Ligati on of PCR Fragments and Direct 
Transformation of Bacillus sub tiHs to Bypass the E. co// Cloning Step 

In this Example, a method ("Method 3") for creating DNA constructs using ligation 
of PCR fragments and direct transfonnation of Bacillus are described. By way of example, 
modification of prpC, sigD and tdh/kblare provided to demonstrate the method of ligation. 
Indeed, sigD and tdh/kblyNexe constmcted by one method and prpC by an alternate 
method. 

A. Tdh/Kbl and SigD 

The upstream and downstream fragments adjacent to the tdh/l(bl region of the 
Bacms subtills chromosome were amplified by PCR similar to as described in Method 1, 
except that the inside primer of the flaniting DNA was designed to contain type 11 s 
restriction sites. Primers for the loxP-spectinomycin-loxP cassette were designed with the 
same type II s restriction site as the flani<s and complementary overiiangs. Unique 
overhangs for the left fianit and the right flank allowed directional ligation of the 
antimicrobial cassette between the upstream and downstream flanking DNA. All DNA 
fragments were digested with the appropriate restriction enzymes, and the fragments were 
purifled with a Qiagen Qiaquick PCR purification kit using the manufacturer's Instmctions. 
This purification was followed by desalting In a 1 mL spin column containing BioRad P-6 
gel and equilibrated with 2 mM Tris-HCI. pH 7.5. Fragments were concentrated to 124 to 
250 ng/pL using a Savant Speed Vac SC110 system. Three piece ligations of 0.8 to 1 yg 
of each fragment were performed with 12U T4 ligase (Roche) In a 15 to 25 \iL reaction 
volume at 14 to 16*C for 16 hours. The total yield of the desired ligation product was >100 
ng per reaction, as estimated by comparison to a standard DNA ladder on an agarose 
gel. The ligation mixture was used without purification for transformation reactions. 
Primers for this constmction are shown in Table 5, below 
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Table 5. Primers for tdh/kbl Deletion 



Primer 
Name 


Restriction 
Enzyme 

Engineered 
Into 
Primer 


Primer Sequence 


SEQ 

ID 

NO: 


d70 DR 


none 


CTCAGTTCATCCATCAAATCACCAAGTCCG 


175 


PB2 DF 


Bbsl 


TACACGTTAGAAGACGGCTAGATGCGTCTGATTGTGACAGAC 
GGCG 


176 


d71 UF 


none 


AACCTTCCAGTCCGGTTTACTGTCGC 


177 


P83 UR 


Bbsl 


GTACCATAAGAAGACGGAGCTTGCCGTGTCCACTCCGATTAT 
AGCAG 


178 


d98sdc F 


Bbsl 


CCTTGTCTTGAAGACGGAGCTGGATCCATAACTTCGTATAATG 


179 


D106SDCR 


Bbsl 


GTACCATAAGAAGACGGCTAGAGGATGCATATGGCGGCCGC 


180 


D112UF* 


none 


CATATGCTCCGGCTCTTCAAGCAAG (analytical primer) 


181 


D113DR* 


none 


CCTGAGATTGATAAACATGAAGTCCTC (analyUcal primer) 


182 



*primers for analytical PGR 



The construct for the sigD deletion closely followed construction of tdh/kbl. The 
primers used for the sigP construction are provided in Table 6. 



Table 6. Primers for sigD Construction 



Primer 
Name 


Restriction 

Enzyme 
Engineered 
Into 
Primer 


Primer Sequence 


SEQ 
ID 
NO: 


SigD UF 


none 


ATATTGAAGTCGGCTGGATFGTGG 


183 


SigD UR 


Bglll 


GCGGCAGATCTCGGCGCATTAAGTCGTCA 


184 


SigD DF 


EcoRI 


GCGGCGAATTCTCTGCTGGAAAAAGTGATACA 


185 


SigD DR 


none 


TTCGCTGGGATAACAACAT 


186 


Loxspc UF 


Bglll 


GCGGCAGATCTTAAGCTGGATCCATAACTTCG 


187 


Loxspc DR 


EcoRI 


GCGGCGAATTCATATGGCGGCCGCATAACTTC 


188 


SigD UO 


none 


CAATTTACGCGGGGTGGTG 


189 


SigD DO 


none 


GAATAGGTTACGCAGTTGTTG 


190 


Spc UR 


none 


CTCCTGATCCAAACATGTAAG 


191 


Spc DF 


none 


AACCCTTGCATATGTCTAG 


192 
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B. PrpC 

An additional example of creating a DNA molecule by ligation of PGR amplified 
DNA fragments for direct transfomration of Bacillus involved a partial In-frame deletion of 
the gene prpC. A 3953 bp fragment of Bacillus subtllls cliromosomal DNA containing the 
prpC gene was amplified by PGR using primers p95 and p96. The firagment was cleaved 
at unique restriction sites FUM and SstXI. This yielded three fragments, an upstream, a 
downstream, and a central fragment. The latter is the fragment deleted and consists of 
170 bp located Internal to the prpC gene. The digestion mixture was purified with a 
Qiagen Qiaquick PGR purification kit followed by desalting in a 1 mL spin column 
containing BloRad P-6 gel and equilibrated with 2 mM Tris-HCI, pH 7.5. In a second PGR 
reaction, the antimicrobial cassette, loxP-specllnomycin-loxP. was amplified with the 
primer containing a BsfXI site and the downstream primer containing a PflM\ site both with 
cleavage sites complementary to the sites in the genomic DNA fragment. The fragment 
was digested with PfliWI and BsOCI and purified as described for the chromosomal 
fragment above. A three piece ligation of the upstream, antimicrobial cassette, and the 
downstream fragments was carried out as for tdh/kbl, described above. The yield of 
desired ligafion product was similar and the ligation product was used without further 
treatment for the transformation of xylRcomK competent Bacillus subtilis. as described in 
greater detail below. 



Table 7. Primers for prpC Deletion 



Primer 
Name 


Restriction 
Enzyme 

Engineered 
Into 
Primer 


Primer Sequence 


SEQ 
ID 
NO: 


p95 
DF 


none 


GCGCCCTTGATCCTAAGTCAGATGAAAC 


193 


p96 
UR 


none 


CGGGTCGGATACTGACTGTAAGTTTGAC 


194 


p100 

SDCR 


PflMI 


GTACCATAACCATGCCTTGGTTAGGATGCATATGGCGGCCGC 


195 


p101 

SDCF 


BstXI 


CCTTGTCTTCCATCTTGCTGGAGGTGGATCCATAACTTCGTATAATG 


196 


p114 
anal. 


none 


GAGAGCAAGGACATGACATTGACGC 


197 


p115 
anal.* 


none 


GATCTTCACCCTCTTCAACTTGTAAAG 


198 



*anai.. analytical PGR primer 
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C. PckA Deletion 

In addition to the above deletions. pckA was also nnodlfied. The PCR primers pckA 
UF, pckA-2Urfus, spc ffus, spc rfus. pckA Dffus and pckA DR. were used for PCR and 
PCR fusion reactions using the chromosomal DNA of a Bacillus subtilis 1168 derivative 
and PDG1726 (See. Guerout-Fleury et aL, Gene 167(1-2):335-6 [1995]) as template. The 
primers are shown in Table 8. The method used in constmctlng these deletion mutants 
was the same as Method 1 , described above. 



Table 8. Primers Used for PckA Deletion 



Primer 
Name 


Restriction 
Enzyme 

Engineered 
Into 
Primer 


Primer Sequence 


Seq 
ID 
NO: 


DckA UF 


none 


TTTGCTTCCTCCTGCACAAGGCCTC 


199 


DckA-2URfus 


none 


CGTrATrGTGTGTGGATTTCCATTGT 


200 


SPC ffus 


none 


CAATGGAAATGCACACACAATAACGTGACTGGCAA 
GAGA 


201 


pckA DFfus 


none 


GTAATGGCCCTCTCGTATAAAAAAC 


202 


SDC rfus 


none 


GTTTTTTATACGAGAGGGCCATTACCAATTAGAAT 
GAATATTTCCC 


203 


DCkA OR 


none 


GAGCAAAATGTTTCGATrCAGCATTGCT 


204 



D. Xylose-Induced Competence Host Cell Transformation with 
Ligated DNA. 

Cells of a host strain Bacillus subtilis with partial genotype xylRoomK, were rendered 
competent by growth for 2 hours In Luria-BertanI medium containing 1% xylose, as 
described In U.S. Patent Appln. Ser. No. 09/927,161, filed August 10, 2001. herein 
incorporated by reference, to an OD550 of 1 . This culture was seeded from a 6 hour culture. 
All cultures were grown at 37'C. with shaking at 300 rpm. Aliquots of 0.3 mL of were frozen 
as 1:1 mixtures of culture and 30% glycerol in round bottom 2 mL tubes and stored in liquid 
nitrogen for future use. ^ 

For transfonnation, frozen competent cells were thawed at 37 ®C and immediately 
after thawing was completed, DNA from ligation reaction mixtures was added at a level of 
5 to 1 5 |jL per tube. Tubes were then shaken at 1400 rpm (Tekmar VXR S-1 0) for 60 min 
at 37 **C. The transformation mixture was plated without dilution in 100 uL aliquots on 8 
cm lA plates containing 100 ppm of spectinomycin. After growth over night, 
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transfbrmants were picked into Luria-Bertani (100 ppm spectinomycin) and grown at 37 
"C for genomic DNA isolation perfonned as known in the art (See e.g., Hanwood and 
Cuttings, Molecular Biolooical Mettiods for Bacillus . John Wiley and Son, New York. N.Y. 
[1990], at p. 23). Typically 400 to 1400 transfonnants were obtained from 100 uL 
transformation mix, when 5 uL of ligation reaction mix was used in the transfonnation. 

When the antimicrobial marker was located between two toxP sites In the incoming 
DNA, the mariner could be removed by transfortning the strain with a plasmid containing 
the ere gene capable of expression the Cre protein. Cells were transfomned with pCRM- 
TS-pleo (See below) cultured at 37 "C to 42 *C, plated onto LA arid after colonies fonned 
patohed onto LA containing 100 ppm spectinomycin. Patches which did not grow after 
overnight incubation were deemed to have tost the antimicrobial maker. Loss of maker 
was verified by PCR assay with primers appropriate for the given gene. 

pCRM-TS-pieo has the following sequence (SEQ ID NO:205): 

GGGGATCTCTGCAGTGAGATCTGGTAATGACTCTCTAGCTTGAGGCATCAAATAAAACGAM^^ 

GCTCAGTCGAAAGACTGGGCCmCGTmATCTGTTGmGTCGGTGAACGCTCTCCTGA^^ 

GGACAAATCCGCCGGTCTAGCTAAGCAGAAGGCCATCCTGACGGATGGCCTTTTTGCGTTTCT 

ACAAACTCTTGTTAACTCTAGAGCTGCCTGCCG CGmCG GTGATGAAGATCTTGCCGA-r^ATT 

AATTAATTCAGAACGCTCGGTTGGCGCCGGGCGTTTTTTATGCAGCAATGGCAAGAACGTTGC 

TCTAGAATAATTCTACACAGCCCAGTCCAGACTATTCGGCACTGAAATTATGGGTGAAGTGGTC 

AAGACCTCACTAGGCACCTTAAAAATAGCGCACCCTGAAGAAGATTTAmGAGCTAGCCCTT 

GCCTACCTAGCTTCCAAGAAAGATATCCTAACAGCACAAGAGCGGAAAGATGTTTTGTTCTACA 

TCCAGAAGAACCTCTGCTAAAATTCCTGAAAAATTnGCAAAAAGTTGTTGACTTTATCTACAAG 

GTGTGGCATAATGTGTGGAATTGTGAGCGGATAACAATTAAGCTTAGGAGGGAGTGTTAAATG 

TCCAATTTACTGACCGTACACCAAAATTTGCCTGCATTACCGGTCGATGCAACGAGTGATGAG 

GTTGQCAAGAACCTGATGGACATGTrCAGGGATCGGCAGGCGTnTCTGAGCATACCTGGAAA 

ATGCTTCTGTCCGTTTGCCGGTCGTGGGCGGCATGGTGCAAGTTGAATAACCGGAAATGGTTT 

CCCGCAGAACCTGAAGATGTTCGCGATTATCTTCTATATCTTCAGGCGCGCGGTCTGGCAGTA 

AAAACTATCCAGCAACATTTGGGCCAGCTAAACATGCTTCATCGTCGGTCCGGGCTGCCACGA 

CCAAGTGACAGCAATGCTGTTTGACTGGTTATGCGGCGGATCCGAAAAGAAAACGTTGATGCC 

GGTGAACGTGCAAAACAGGCTCTAGCGTrGGAACGCACTGATTTCGACCAGGTTCGTTCACTG 

ATGGAAAATAGCGATCGCTGCCAGGATATACGTAATCTGGCATTTCTGGGGATTGCTTATAACA 

CCCTGTTACGTATAGCCGAAATTGCCAGGATCAGGGTrAAAGATATCTCACGTACTGACGGTG 

GGAGAATGTTAATCCATATTGGCAGAACGAAAACGCTGGTTAGCACCGCAGGTGTAGAGAAG 

GCACTTAGCCTGGGGGTAACTAAACTGGTCGAGCGATGGATTTCCGTCTCTGGTGTAGCTGAT 

GATCCGAATAACTACCTGTnTGCCGGGTCAGAAAAAATGGTGTTGCCGCGCCATCTGCGACC 

AGCCAGCTATCAACTCGCGCCCTGGAAGGGATTTTTGAAGGAACTCATCGATTGATTTACGGC 

GCTAAGGATGACTCTGGTCAGAGATACCTGGCCTGGTCTGGACACAGTGCCCGTGTCGGAGC 

CGCGCGAGATATGGCCCGCGCTGGAGTrrCAATACCGGAGATCATGCAAGCTGGTGGCTGGA 

CCAATGTAAATA7TGTCATGAACTATATCCGTAACCTGGATAGTGAAACAGGGGCAATGGTGC 

GCCTGCTGGAAGATGGCGATrAGGAGCTCGGATCACACGCAAAAAGGAAATTGGAATAAATGC 

GAAAmGAGATGTTAAnAAAGACCTTmGAGGTCTTTTTTTCTTAGATTmGGGGTTAm 

GGGGAGAAAACATAGGG6GGTACTACGACCTCCCCCCTAGGTGTCCATTGTCCATTGTCCAA 

ACAAATAAATAAATATTGGGTTTTTAATGTTAAAAGGTTGT I I I I I ATGTTAAAGTGAAAAAAACA 

GATGTTGGGAGGTAGAGTGATAGTTGTAGATAGAAAAGAAGAGAAAAAAGTTGCTGTTACTTTA 

AGACTTACAACAGAAGAAAATGAGATATTAAATAGAATCAAAGAAAAATATAATATTAGCAAATC 

AGATGCAACCGGTATTCTAATAAAAAAATATGCAAAGGAGGAATACGGTGCATrTTAAACAAAA 

AAAGATAGACAGCACTGGCATGCTQCCTATCTATGACTAAATTTTGTTAAGTGTATTAGCACCG 

TTATTATATCATGAGCGAAAATGTAATAAAAGAAACTGAAAACAAGAAAAATTCAAGAGGACGT 
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MTTGGACATTTGTTTTATATCCAGAATCAGCAAAAGCCGAGTGGTTAGAGTATTTAAAAGAGT 

TACACATTCAATTTGTAGTGTCTCCATTACATGATAGGGATAGTGATACAGAAGGTAGGATGAA 

AAAAGAGGATTATCATATTCTAGTGATGTATGAGGGTAATAAATCTTATGAACAGA^i^^^ 

TTAACAGAAGAATTGAATGCGACTATTCCGCAGATTGCAGGAAGTGTGAAAGGTCTTCTGAGA 

TATATGCTTCACATGGACGATCCTAATAAAmAAATATCAAAAAGAAGATATGAT^^ 

CGGTGTAGATGTTGATGAATTATTAAAGAAAACAACAACAGATAGATATAAATTAATTAAAGAAA 

TGATTGAGTTTATTGATGAACAAGGAATCGTAGAATTTAAGAGTTTAATGGATTATGCAATGAAG 

mAAATTTGATGATTGGTTCCCGCTmATGTGATAACTCGGCGTATGTTATTCAAGAATATAT 

AAAATCAAATCGGTATAAATCTGACCGATAGATTTTGAAmAGGTGTCACAAGACACTC^^ 

TCGCACCAGCGAAAACTGGTTTAAGCCGACTGGAGCTCCTGCACTGGATGGTGGCGCTGGAT 

GGTAAGCCGCTGGCAAGCGGTGAAGTGGCTCTGGATGTCGCTCCACAAGGTAAACAGTTGAT 

TGAACTGCCTGAACTAeCGCAGCCGGAGAGCGCCGGGCAAGTCTGGCTCACAGTACGCGTAG 

TGGAAGCGAACGCGACCGCATGGTCAGAAGCCGGGCACATCAGCGCCTGGCAGCAGTGGCG 

TCTGGCGGAAAACCTCAGTGTGACGCTCCCCGCCGCGTCCCACGCCATCCCGCATCTGACCA 

CCAGCGAAATGGATTmGCATCGAGCTGGGTAATAAGCGTTGGCAAmAACCGCCAGTCA^ 

GCTTTCTTTCACAGATGTGGATTGGCGATAAAAAACAACTGCTGACGCCGCTGCGCGATCAGT 

TCACCCGTGCACCGCTGGATAACGACATTGGCGTAAGTGAAGCGACCCGCATTGACCGTAAC 

GCCTGGGTCGAACGCTGGAAGGCGGCGGGCCATTACCAGGCCGAAGCAGCGTTGTTGCAGT 

GCACGGCAGATACACTTGCTGATGCGGTGCTGATTACGACCGCTCACGCGTGGCAGCATCAG 

GGGAAAAGCTTATTTATCAGCCGGAAAACCTACCGGATTGATGGTAGTGGTGAAATGGCGATr 

ACCGTTGATGTTGAAGTGGCGAGCGATACACCGCATGC6GCGCGGATTGGCCTGAACTGGCA 

GCTGGCGCAGGTAGCAGAGCGGGTAAACTGGCTCGGATTAGGGCCGCAAGAAAACTATCCC 

GACCGCCTTACTGCCGGCTGTTTTGACCGCTGGGATCTGCCATTGTCAGACATGTATACCCCG 

TACGTCTTCGCGAGCGAAAACGGTCTGCGCTGCGGGACGCGCGAATTGAATTATGGCCCACA 

CCAGTGGCGCGGCGACTTCCAGTTCAACATCAGCCGCTACAGTCAACAGCAACTGATGGAAA 

CCAGCGATCGCCATCTGCTGCACGCGGAAGAAGGCACATGGCTGAATATCGACGGTTTCCAT 

ATGGGGATTGGTGGCGACGACTCCTGGAGCCCGTCAGTATCGGCGGAATTCGAGCTGAGCG 

CCGGTCGCTACCATTACCAGTTGGTCTGGTGTCAAAAATAATAATAACCGGGCAGGCCATGTC 

TGCCCGTATTTCGCGTAAGGAAATCCATTATGTACTATTTCAAGCTAATTCCGGTGGAAACGAG 

GTCATCAmCCTTCCGAAAAAACGGTTGCATTTAAATCTTACATATGTAATACTTTCAAAGACT 

ACATTTGTAAGATTTGATGTTTGAGTCGGCTGAAAGATCGTACGTACCAATTATTGTTTCGTGAT 

TGTTCAAGCGATAACACTGTAGGGATAGTGGAAAGAGTGCTrCATCTGGTrACGATCAATCAAA 

TATTCAAACGGAGGGAGACGATTTTGATGAAACCAGTAACGTTATACGATGTCGCAGAGTATG 

CCGGTGTCTCTTATCAGACCGTTTCCCGCGTGGTGAACCAGGCCAGCCACGTTTCTGCGAAAA 

CGCGGGAAAAAGTGGAAGCGGCGATGGCGGAGCTGAATTACATTCCCAACCGCGTGGCACAA 

CAACTGGCGGGCAAACAGTCGTTGCTGATTGGCGTTGCCACCTCCAGTCTGGCCCTGCACGC 

GCCGTCGCAAATTGTCGCGGCGATrAAATCTCGCGCCGATCAACTGGGTGCCAGCGTGGTGG 

TGTCGATGGTAGAACGAAGCGGCGTCGAAGCCTGTAAAGCGGCGGTGCACAATCTTCTCGCG 

CAACGCGTCAGTGGGCTGATCATTAACTATCCGCTGGATGACCAGGATGCCATTGCTGTGGAA 

GCTGCCTGCACTAATGTTCCGGCGTTATTTCTTGATGTCTCTGACCAGACACCCATCAACAGTA 

TTATTTTCTCCCATGAAGACGGTACGCGACTGGGCGTGGAGCATCTGGTCGCATTGGGTCACC 

AGCAAATCGCGCTGTTAGCGGGCCCATTAAGTTCTGTCTGGGCGCGTCTGCGTCTGGCTGGG 

TGGCATAAATATCTCACTCGCAATCAAATTCAGCCGATAGCGGAACGGGAAGGCGACTGGAGT 

GCCATGTCCGGTTTTCAACAAACCATGCAAATGCTGAATGAGGGGATCGTTCCCACTGCGATG. 

CTGGTTGCCAACGATCAGATGGCGCTGGGCGCAATGCGCGCCATTACCGAGTCCGGGCTGC 

GCGTTGGTGCGGATATCTCGGTAGTGGGATACGACGATACCGAAGACAGCTCATGTTATATGC 

CGGCGTCAACCACCATCAAACAGGATTTTCGCCTGCTGGGGCAAACCAGCGTGGACCGCTTG 

CTGCAACTCTCTCAGG6CCAGGCGGTGAAGGGCAATCAGCTGTTGCCCGTCTCACTGGTGAA 

AAGAAAAACCACCCTGGCGCCCAATACGCAAACCGGCTCTCCCGGCGCGTTGGCCGATTCAT 

TAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAA 

TGTGAGTTAGGCATCGCATCCTGCCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACAC 

ATGCAGCTCCCGGAGACGGTCACAGCTrGTCTGTAAGCGGATGCCGGGAGCAGACAAGCGC 

GTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCGCAGCCATGACCCAGTCACGTAG 

CGATAGCGGAGTGTATACTGGCTTAACTATGCGGCATCAGAGCAGATTGTACTGAGAGTGCAC 

CATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAATACCGGATCAGGCGGTCTTGC 

GCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCA 

CTGAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAA6AACATGTGAG 

CAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTrTTTCCATAGG 
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GTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTMCAGGATTAGCAGAGCGAGGTA 

?g?JgScggtg?t^^^ 

ATrTGGTATGTGCGCTCTGGTGAAGCCAGTTACCTTCGG^^ 
cSc^CAMCCACCGCTGGTAGCGGTGGTTTTm 

^SS^GG^rCTCM^^ 
XcTCACGTTMGGGATTrrGGTCATC^^ 

i??S^TGAAGrnTM?rc^ 

tS^^CAGTGAGGCACCTATCTCAGCGATCTC^^^ 

TCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATG 

ItaSSgSacccacot 

GGCCGA?CG^^^^ 

GGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGmGCGCAACGTTGTTGCCATTGCTA^^^ 
§S^GTGG?GTCACGCTC^^ 

gcSgttacatgatccc 

fdTCA(SAGTAAGTTGG^^^ 

ACTGTCATGCCATCCGTAAGATGCTmCTGTGACTGGTGAGTACTCAACC^^ 

aSSbtgtatS^cggcgaccgagttgctcttgcccggcgtgaacacgggataatagc 

SATAGC^MCTTTMyS^^^^ 

a?J??acS^tJSsXSaW^ 

crfTTACTTTCACCAGCGTrrCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGG 

gIJS^^cgacacggamtg^^ 

?^fcJSGGTrATTGT^^^ 

GGTTCCGCGCACAmCCCCGAAAAGTGCCACCTGACGTCCAATAGACCAGTT^MTCCAM 

CGAGAGTCTAATAGAATC 
AMTG>Sw3GGG^^ 

attaaaJagagtat^^ 
aScg^ttctaatgtgtaatgaggt^ 

ccg 

E. Transcrlptome DNA Array Methods 

In addition to the above methods, transcriptome DNA array methods were used in 
the development of mutants of the present invention. First, target RNA was harvested 
from a Bacillus strain by guanidinium acid phenol extraction as known in the art (See e.g.. 
Farrell, RNA Methodologies . (2nd Ed.). Academic Press. San Diego, at pp. 81] and time- 
point was reverse-transcribed into biotin-labeled cDNA by a method adopted from 
deSaizieu et al. (deSaizieu et al.. J. Bacteriol.. 182: 4696-4703 POOO]) and described 
herein. Total RNA (25 mg) was Incubated 37«C overnight in a 100-mL reaction: 1x GIBCO 
first-strand buffer (50 mM Tris-HQ pH 8.3. 75 mlVI KCI. 3 mM MgCk): 10 mM DTT; 40 mM 
random hexamer; 0.3 mM each dCTP, dGTP and dTTP; 0.12 mM dATP; 0.3 mM blotln- 
dATP (NENO; 2500 units Superscript II revense-transcriptase (Roche). To remove RNA. 
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the reaction was brought to 0.25 M NaOH and incubated at 65'C for 30 minutes. The 
reaction was neutralized with HCl and the nucleic acid precipitated at 
-20°C in ethanol with 2.5 M ammonium-acetate. The pellet was washed, air-dried, 
resuspended in water, and quantitated by UV spectroscopy. The reaction yield was 
approximately 20-25 mg biotin-labeled cDNA. 

Twelve mg of this cDNA were fragmented in 33 mL 1x One-Phor-AII buffer 
(Amersham-Phamiacia #27-0901-02) with 3.75 milliunits of DNasel I at ST'C for 10 
minutes. After heat-killing the DNase. fragmentation was validated by running 2 mg of the 
fragmented cDNA on a 3% agarose gel. Biotin-containing cDNA routinely ranged in size 
from 25 to 125 nucleotides. The remaining 10 mg of cDNA were hybridized to an 
Affymetrix Bacillus GeneChip anray. 

Hybridizations were performed as described in the Affymetrix Expression Analysis 
Technical Manual (Affymetrix) using reagent suppliers as suggested. Briefly, 10 mg of 
fragmented blotin-Iabeled cDNA were added to a 220-mL hybridization cocktail containing: 
100 mM MES (N-morpholinoethanesufonic acid), 1M Na*. 20 mM EDTA, 0.01% Tween 
20; 5 mg/mL total yeast RNA; 0.5 mg/mL BSA; 0.1 mg/mL herring-spenm DMA; 50 pM 
control oligonucleotide (AFFX-B1). The cocktails were heated to QSX for 5 minutes, 
cooled to 40-C for 5 minutes, briefly centrifuged to remove particulates, and 200 mL was 
injected into each pre^warmed pre-rinsed (1x MES buffer + 5 mg/ml yeast RNA) GeneChip 
cartridge. The arrays were roteted at 40°C overnight 

The samptes were removed and the anrays were filled with non-stringent wash 
buffer (6x SSPE, 0.01 % Tween 20) and washed on the Affymetrix fluidlcs station with 
protocol Euk-GE-WS2, using non-stringent and stringent (0.1 M MES, 0.1 M [Na*]. 0.01% 
Tween 20) wash buffers. Anays were stained in three steps: (1 ) streptavidin; (2) antl- 
streptavidin antibody tagged with blotln; (3) streptevidin-phycoerythrin conjugate. 

The signals in the anays were detected with the Hewlett-Packard Gene An^y 
Scanner using 570 nm laser light with 3-mm pixel resolution. The signal intensities of the 
4351 ORF probe sete were scaled and nonnalized across all time points comprising a time 
course experiment. These signals were then compared to deduce the relative expression 
levels of genes under investigation. The threonine biosynthetic and degradatlve genes 
were simultaneous transcribed, indicating inefficient threonine utilization. Deletion of the 
degradatlve threonine pathway improved expression of the desired product (See, Figure 
7). The present invention provides means to modify pathways with transcription profiles 
that are similar to threonine biosynthetic and degradative profiles. Thus, the present 
invention also finds use in the modification of pathways with transcription profiles similar to 
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threonine in order to optimize Bacillus strains. In some preferred embodiments, at least 
one gene selected from the group consisting of rocA, ycgN, ycgM mcF and mcD Is 
deleted or othenwlse modified. Using the present invention as described herein resulted in 
»ie surprising discovery that the sigD reguion was transcribed. Deletion of this gene 
resulted in better expression of the desired product (See, Figure 7). It was also surprising 
to find the transcription of gapB and pckA. Deletion oipckA did not result In improvement 
or detriment However, the present Invention provides means to improve strain protein 
production through the combination of pcfoA deletion or modification and deletion or 
modification of gapB and/or fbp. In addition, during the development of the present 
Invention, it was observed that the tryptophan biosynthetic pathway genes showed 
unbalanced transcription. Thus, it is contemplated that the present Invention will find lise 
in producing strains that exhibit Increased transcription of genes such as those selected 
from the group consisting of trpA. trpB, trpC, trpD, trpE. and/or trpF, such that the 
improved strains provide improved expression of the desired product, as compared to the 
parental (/.e., wild-type and/or originating strain). Indeed, it Is contemplated that 
modifications of these genes in any combination will lead to improved e)q?resslon of the 
desired product 

F. Fermentations 

Analysis of the strains produced using the above constructs were conducted 
following fermentation. Cultures at 14 L scale were conducted in Biolafitle* fennenters. 
Media components per 7 liters are listed in Table 9. 

Table 9. Media Components per 7L Fermentation 



NaH2P04-H20 


0.8% 


56g 


KH2P04 


0.8% 


56g 


MgS04-7H20 


0.28% 


19.6g 


antifoam 


0.1% 


7g 


CaCI2-2H20 


0.01% 


0.7g 


ferrous sulfate-7H20 


0.03% 


2.1g 


MnCI2-4H20 


0.02% 


1.4g 


trace metals 100 x 


1% 


70g 


stock* 






H2S04 


0.16% 


11. 2g 
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60% glucose . 1.29% 90 

"See, Harwood and Cutting, supra, at p. 649 

The tanks were stirred at 750 rpm and airflow was adjusted to 1 1 Liters per minute, 
the temperature was 37*C. and the pH was maintained at 6.8 using NH4OH. A 60% 
glucose solution was fed starting at about 14 hours in a linear ramp from 0.5 to 2.1 grams 
per minute to the end of the fennentation. Off-gasses were monitored by mass 
spectrometry. Carbon balance and efficiency were calculated from glucose fed, yield of 
protein product, cell mass yield, other carbon In broth, and CO2 evolved. A mutant strain 
was compared to parent strain to judge improvements. Although this mutant pckA strain 
did not show improvement under these conditions, it is contemplated that improvements 
will be produced under modified culture conditions {I.e., as known to those in the art), 
and/or incorporation of additional genes. In some preferred embodiments, these 
additional genes are selected from the group consisting of gapB, alsD, and/or fbp 

EXAMPLE 4 

Host Cell Transformation To Obtain An Altered Bacillus Strain 
Once the DNA constmct was created by Method 1 or 2 as described above, it was 
transformed into a suitable Sac///us subtllis lab strain (e.g., BG2036 or BG2097; any 
competent Bacillus immediate host ceil may be used in the methods of the present 
invention). The cells were plated on a selective media of 0.5 ppm phleomycin or 100 ppm 
spectinomycin as appropriate (Fenrari and Miller, Bacillus Expression: A Gram-Positive 
Model i n Gene Expression Svstems: Using Natur e for the Art of Expression, pgs 65-94 
[1999]). The laboratory strains were used as a source of chromosomal DNA carrying the 
deletion that was transformed into a Bacillus subtllis production host strain twice or 
BG3594 and then MDT 98-1 13 once. Transformants were streaked to isolate a single 
colony, picked and grown overnight in 5 mL of LB plus the appropriate antimicrobial. 
Chromosomal DNA was isolated as known in the art (See e.g., Hardwood et aL, supra). 

The presence of the integrated DNA constmct was confirmed by three PGR 
reactions, with components and conditions as described above. For example, two 
reactions were designed to amplify a region from outside the deletion cassette into the 
antimicrobial gene in one case (primers 1 and 1 1) and through the entire insert in another 
(primers 1 and 12). A third check amplified a region from outside the deletion cassette 
into the deleted region (primers 1 and 4). Figure 4 shows that a correct clone showed a 
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band in the first two cases but not the third. Wild-type Bacillus subtllis chromosomal DNA 
was used as a negative control in ail reactions, and should only amplify a band with the 
third primer set 

EXAMPLE 5 

Shake Flask Assays - Measurement of Protease Acfivitv. 

Once the DNA constaict was stably integrated Into a competent Bacillus subtllis 
strain, the subtilisin activity was measured by shake flask assays and the activity was 
compared to wild type levels. Assays were perfonned In 250 ml baffled flasks containing 
50 mL of growth media suitable for subtilisin production as known In the art {See. 
Christlanson etal.. Anal. Biochem., 223:119-129 [1994]; and Hsia etai. Anal. Biochem. 
242:221 - 227 [1996]). The media were inoculated with 50 fiL of an 8 hour 5mL culture 
and grown for 40 houre at 37-C with shaking at 250 RPM. Then, 1 mL samples were 
taken at 17, 24 and 40 hours for protease activity assays. Protease activity was measured 
at 405 nM using the Monarch Automatic Analyser. Samples in duplicate were diluted 1 :1 1 
(3.131 g/L) In buffer. As a control to ensure conect machine calibration one sample was 
diluted 1:6 (5.585 g/L), 1:12 (2.793 g/L and 1:18 (1.862 g/L). Figure 7 illustrates the 
protease activity In various altered Bacillus subtilis clones. Figure 8 provides a graph 
showing improved protease secretion as measured fi-om shake flask cultures in Bacillus 
SUW///S wild-type strain (unaltered) and conesponding altered deletion strains (-sbo) and (- 
s/r). Protease activity (g/L) was measured after 17, 24 and 40 hours. 

Cell density was also detemiined using spectrophotometric measurement at an OD 
of 600. No significant differences were obsen/ed for the samples at the measured time 
(data not shown). 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method 
and system of the invention will be apparent to those skilled in the art without departing 
from tiie scope and spirit of the invention. Although Uie invention has been described in 
connection with specific preferred embodiments, it should be understood tiiat the inventi'on 
as should not be unduly limited to such specific embodiments. Indeed, various 
modifications of the described modes for carrying out the invention that are obvious to 
those skilled in the art and/or related fields are intended to be wrtthln the scope of tiie 
present invention. 
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CLAIMS 

1 . A method for enhancing expression of a protein of interest from Bacillus 
comprising: 

a) obtaining an altered Bacillus strain capable of producing a protein of 
interest, wherein said altered Bacillus strain has at least one inactivated chromosomal 
gene selected from the group consisting of sbo, sir, ybcO, csn, spollSA, sigB, phrC, 
rapA, CssS, trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, alsD, sigD, prpC, gapB, pckA, 
fbp, rocA, ycgN, ycgM, rocF, and rocD\ and 

b) growing said altered Bacillus strain under conditions such that said 
protein of interest is expressed by said altered Bacillus strain, wherein said 
expression of said protein of interest is enhanced compared to the expresston of said 
protein of interest in an unaltered Bacillus host strain. 

2. The method of Claim 1 . wherein said protein of interest is selected from the 
group consisting of homologous proteins and heterologous proteins. 

3. The method of Claim 1 , wherein said protein of interest is an enzyme selected 
from the group consisting of proteases, cellulases, amylases, carbohydrases, lipases, 
isomerases, transferases, kinases, and phosphatases. 

4. The method of Claim 3, wherein said protein of interest Is a protease. 

5. The method of Claim 1 , wherein said altered Bacillus strain is obtained 
by deleting one or more chromosomal genes selected from the group consisting of sbo, sir, 
ybcO, csn, spollSA, sIgB, phrC, mpA, CssS, trpA, trpB, trpQ trpD, trpE, trpF, tdh/kbl, alsD, 
sigD, prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and rocD: 

6. An altered Bacillus strain obtained using the method of Claim 1 . 

7. An altered Bacillus strain comprising a chromosomal deletion of one or more 
genes selected from the group consisting of sbo, sir, ybcO, csn, spollSA, sigB, phrC, rapA, 
CssS, trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, alsD, sigD, prpC, gapB, pckA, fbp, rocA, 
ycgN, ycgM, rocF, and mcD, 
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8. The altered Bacillus strain of Qaim 7, wherein said altered strain is a B. 
suM/te strain. 

9. The altered Bacillus strain of Claim 7. wherein said altered Bacillus strain Is a 
protease producing strain. 

10. The altered Bacillus strain of Claim 9, wherein said protease is a sul)tllisin. 

11. The altered Bacillus strain of Claim 1 1 . wherein said subtilisin is selected from 
the group consisting of subtilisin 168. subtilisin BPN'. subtilisin Carisberg. subtilisin DY. 
subtilisin 147, subtilisin 309 and variants thereof. 

1 2. The altered Bacillus strain of Claim 7, wherein said altered Bacillus strain 
further comprises a mutation in a gene selected from the group consisting of degi;. degQ, 
degS. S0OC4, spollE, and oppA. 

1 3. The altered Bacillus strain of Claim 7, wherein said altered Bacillus strain 
further comprises a heterologous protein of Interest 

14. A DNA construct comprising at least one gene selected from the group 
consisting of sbo, sir. ybcO, csn, spollSA, sigB. phrC, rapA. CssS, trpA, trpB, trpC, trpD. trpE. 
trpF, tdh/kbl, alsD, sigD, prpC, gapB, pcl<A fbp. rocA, ycgN, ycgM, rocF, and rocD, gene 
fragments thereof, and homologous sequences thereto, 

1 5. The DNA construct of Claim 14, wherein said at least one gene comprises at 
least one nucleic acid sequence selected fi-om the group consisting of SEQ ID NO: 1, SEQ 
ID NO: 3. SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 11, SEQ ID NO: 13. 
SEQ ID NO: 15. SEQ ID N0:17. SEQ ID NO:39. SEQ ID NO:40. SEQ ID NO:42. SEQ ID 
NO:44. SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:37. SEQ ID NO:25. SEQ 
ID N0.21. SEQ ID NO:50, SEQ ID NO:29, SEQ ID NO:23. SEQ ID NO:27, SEQ ID NO:19, 
SEQ ID N0:31, SEQ ID NO:48, SEQ ID NO:46. SEQ ID NO:35. and SEQ ID NO:33. 

16. The DNA construct of Claim 14. wherein said constmctfurther comprises a 
polynucleotide sequence encoding a protein of interest 
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17. A plasmid comprising the DNA construct of Claim 1 6. 

18. A host cell comprising the plasmid of Claim 17. 

1 9. The host cell of Claim 1 8. wherein said host cell is selected from the group 
consisting of Bacillus cells and £ coll cells. 

20. The host cell of Claim 19, wherein said host cell is B. subtills. 

21 . The host cell of Claim 1 8. wherein said DNA construct has been integrated 
into the chromosome of said host cell. 

22. The DNA construct of Claim 14, wherein said at least one gene encodes at 
least one amino acid sequence selected from the group consisting of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6. SEQ ID NO: 8, SEQ ID NO: 10. SEQ ID NO: 12. SEQ ID NO: 14, SEQ 
ID NO: 16. SEQ ID NO:18. SEQ ID N0:41, SEQ ID NO:43, SEQ ID NO:45. SEQ ID NO:47, 
SEQ ID NO:49, SEQ ID N0:51. SEQ ID NO:38, SEQ ID NO:26, SEQ ID NO:22, SEQ ID 
NO:57, SEQ ID NO:30, SEQ ID NO:24, SEQ ID NO:28, SEQ ID NO:20. SEQ ID NO:32, SEQ 
ID NO:55, SEQ ID NO:53. SEQ ID NO:36, and SEQ ID NO:34. 

23. The DNA constmcl of Claim 14, further comprising a selective marker, 
wherein the selective marker is flanked on each side by a fifagment of said gene or 
homologous gene sequence thereto. 

24. A DNA constmct comprising an incoming sequence, wherein said incoming 
sequence comprises a nucleic acid encoding a protein of interest, and a selective marker 
flanked on each side with a homology box, wherein said homology box includes nucleic add 
sequences having 80 to 100% sequence identity to the sequence immediately flanking the 
coding regions of at least one gene selected from the group consisting of sfao, sir, ybcO, csn, 
spollSA, sIgB, phrC, rapA, CssS, trpA, trpB, trpC, trpD, trpE, trpF, tdh/kbl, alsD, slgD, prpC, 
gapB, pckA, fbp, rocA, ycgN, ycgM, mcF, and rocD. 

25. The DNA construct of Claim 24. further comprising at least one nucleic acids 
which flanks the coding sequence of said gene. 



I 
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26. A plasmid comprising the DNA construct of Claim 25, 

27. A liost cell comprising the plasmid of Claim 26. 

28. The host cell of Claim 27, wherein said host cell is selected from the group 
consisting of Bacillus ceils and £ cott cells. 

29. The host cell of Claim 28, wherein said host cell is B. subtills. 

30. The host ceil of Claim 26, wherein said DNA construct has been integrated 
Into the host cell chromosome. 

31 . The host cell of Claim 30, wherein said selective marker has been excised 
from said host cell chromosome. 

32. A method for ot)taining an altered Baciilus strain with enhanced protease 

production comprising: 

a) transforming a Bacillus host cell with the DNA construct of Claim 14. 
wherein said protein of interest in said DNA constoict is a protease, and wherein said 
DNA construct is integrated into the chromosome of the Bacillus host cell under 
conditions such that said at least one gene is inactivated to produce an altered 

Baciilus strain; and 

b) growing said altered Bacillus strain under conditions such that 
enhanced protease production is obtained. 

33. The method of Claim 32. further comprising recovering said protease. 

34. The method of Claim 32. wherein said at least one inactivated gene Is deleted 
from the chromosome of said altered Bacillus strain. 

35. An altered Bacillus strain produced using the method of Claim 32. 

36 The method of Claim 33, wherein said Bacillus host strain is selected fifom the 
group consisting of 6. llcheniformis, B. lentus, B. subUlis. B. amyloliquefaciens fi. brevls. 8. 
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steamthermophilus, B. alkalophllus, B. coagulans, B. circulans, B. pumllus, B. lautus, B. 
clausii, B. megaterium, and B. thuringiensis. 

37. The method of Claim 36, wherein said Bacillus host celi is B. subtilis. 

38. A method for enhancing expression of a protease in an altered Bacillus 
comprising: 

a) transforming a Bacillus host cell with the DNA construct of Claim 24; 

b) allowing homologous recombination of said DNA construct and a 
region of the chromosome of said Bacillus host cell, wherein at least one gene of said 
chromosome of said Bacillus host cell is inactivated, to produce an altered Bacillus 
strain; and 

c) growing said altered Bacillus strain under conditions suitable for the 
expression of said protease, wherein the production of said protease is greater in the 
altered Bacillus subtilis strain compared to said Bacillus subtilis host prior to 
transformation in step a). 

39. The method of Claim 38, wherein said protease is subtilisin. 

40. The method of Claim 38, wherein said protease is a recombinant protease. 

41 . The method of Claim 38, wherein said inactivation is by deletion of at least 
one of said genes. 

42. The method of Claim 38, wherein said inactivation is by insertional 
inactivation of said at least one of said genes. 

43. The altered Bacillus strain obtained using the method of Claim 38. 

44. The altered Bacillus strain of Claim 38, wherein said altered Bacillus strain 
comprises at least one inactivated gene selected from the group consisting of sfco, sir. ybcO, 
csn. spollSA, sigB, phrC, mpA, CssS, trpA, trpB, trpC, trpD, trpE, trpF, tdhA<bl, alsD, sigD, 
prpC, gapB, pckA, fbp, rocA, ycgN, ycgM, rocF, and rocD. 
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45. The altered Bacillus strain of Claim 44. wherein said inactivated gene has 
been inactivated by deiefion. 

46. The altered Bacillus strain of Claim 44, further comprising at least one 
mutation in a gene selected from the group consisting of degU, degS. degQ, scoC4, 
spollE, and oppA. 

47. The altered Bacillus strain of Claim 46, wherein said mutation is 
degU(Hy)32. 

48. The altered Bacillus strain of Claim 44. wherein said strain is a recombinant 
protease producing strain. 

49. The altered Bacillus strain of Claim 44, wherein said altered Bacillus strain is 
selected from the group consisting of B. licheniformis, B. lentus, B. subtilis, B. 
amyioliquefaciens B. brevis. B. stearotherwophilus. B alkalophilus. B. coagulans. B. 
eiiculans, B. pumilus, B. lautus, B. clausli. B. megaterium, and B. tiiuringiensis. 

50. An altered Bacillus strain comprising a deletion of one or more indigenous 
chromosomal regions or fragments thereof, wherein said indigenous chromosomal region 
includes about 0.5 to 500 kb. and wherein said altered Bacillus strain has an enhanced 
level of expression of a protein of interest compared to a corresponding unaltered Bacillus 
strain when said altered and unaltered Bacillus strains are grown under essentially the 
same growth conditions. 

51 . The altered Bacillus strain of Claim 50. wherein said altered Bacillus strain 
is selected from the group consisting of S. licheniformis, B. lentus, B. subtilis, B. 
amyioliquefaciens B. brevis, B. stearothermophilus, B. alkalophilus, B. coagulans. B. 
eiiculans, B. pumilus, B. lautus, B. clausil, 8. megaterium, and S. thuringlensis. 

52. The altered Bacillus strain of Claim 51 , wherein said altered Bacillus strain 
is selected from the group consisting of B. subtilis. B. licheniformis, and B. 
amyioliquefaciens. 
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53. The altered Bacillus strain of Claim 52, wherein said altered Bacillus strain 
is a B. subtllls strain. 

54. The altered Bacillus strain of Claim 50. wherein said indigenous 
chromosomal region is selected from the group consisting of a PBSX region, a skin region, 
a prophage 7 region, a SPp region, a prophage 1 region, a prophage 2 region, a prophage 4 
region, a prophage 3 region, a prophage 4 region, a prophage 5 region, a prophage 6. 
region, a PPS region, a PKS region, a YVFF-YVEK region, a DHB region and fragments 
thereof. 

55. The altered Bacillus strain of Claim 50, wherein two indigenous chromosomal 
regions or fragments thereof have been deleted. 

56. The altered Bacillus strain of Qaim 50, wherein said protein of interest is a 
protease. 

57. The altered Bacillus strain of Claim 56, wherein said protease Is a subtiHsln. 

58. The altered Bacillus strain of Claim 57, wherein said subtilisin Is selected from 
the group consisting of subtilisin 168, subtilisin BPN*. subtilisin Carisberg. subtilisin DY, 
subtilisin 147 and subtillsin 309 and variants thereof. 

59. The altered Bac///us strain of Qaim 50, wherein said Bauiillus host is a 
recombinant strain. 

60. The altered Bacillus strain of Claim 50, further comprising at least one 
mutation In a gene selected from the group consisting of degU, degQ. degS, sco4. spollE 
and oppA. 

61 . A protease producing Bacillus strain comprising a deletion of an indigenous 
chromosomal region selected from the group consisting of a PBSX region, a skin region, a 
prophage 7 region, a SPP region, a prophage 1 regton. a prophage 2 region, a prophage 3 
region, a prophage 4 region, a prophage 5 region, a prophage 6 region, a PPS region, a PKS 
regfon, a YVFF-YVEK region, a DHB regton and fragments thereof. 
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62. The protease producing Bacillus strain of Claim 61 , wherein said protease is a 
subtilisin. 

63. The protease producing BacHlus strain of Claim 62, wherein said Bacillus is a 
fi.suM//s strain. 

64. The protease producing Bacillus strain of Claim 61 , wherein said protease is a 
heterologous protease. 

65. A method for enhancing the expression of a protein of interest in Bacillus 
comprising: 

a) introducing a DNA oonstmct including a selective marker and an 
inactivating chromosomal segment into a Bacillus host strain, wherein said DNA 
construct is integrated into the chromosome of said Bacillus host strain, resulting in 
the deletion of an indigenous chromosomal region or fragment thereof from said 
Bacillus host cell to produce an altered Bacillus strain; and 

b) growing said altered Bacillus strain under suitable conditions, wherein 
expression of a protein of interest is greater in the altered Bacillus strain compared to 
the expression of the protein of interest in a Bacillus host cell that has not been 
altered. 

66. The method of Claim 65, further comprising recovering said protein of interest 

67. The method of Claim 65, further comprising the step of excising said selective 
marl<er from the altered Bacillus strain. 

68. The method of Claim 65, wherein said indigenous chromosomal region is 
selected from the group of regions consisting of PBSX, SKIN, prophage 7, SPP, prophage 1, 
prophage 2, prophage 3, prophage 4. prophage 5. prophage 6, PPS, PKS, YVFF-YVEK. 
DI-IB and firagments thereof. 

69. The mettiod of Claim 65, wherein said altered Bacillus strain comprises 
deletion of at least two indigenous chromosomal regions. 

70. The method of Claim 65, wherein said protein of interest is an enzyme. 
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71 . The method of Claim 65, wherein said Bacillus host strain is selected from the 
group consisting of B, licheniformis, B. lentus, B. subtilis, B. amyloliquefaciens B. brevls, B. 
stearothermophllus, B. clausii, B. alkalophilus, B. coagulans, B. circulans, B. pumllus and B. 
thuringiensls. 

72. An altered Bacillus strain produced using the method of Claim 65. 

73. A method for obtaining a protein of interest firom a Bacillus strain comprising: 

a) transforming a Bacillus host cell with a DNA construct 
comprising a selective mariner and an inactivating chromosomal segment, wherein 
said DNA construct is integrated into the chromosome of the Bacillus strain resulting 
in deletion of an indigenous chromosomal region or fragment thereof, to produce an 

altered Bacillus strain, 

b) culturing said altered Bacillus strain under suitable growth 
conditions to allow the expression of a protein of interest, and 

c) recovering said protein of interest. 

74. The method of Claim 73, wherein said protein of interest is an enzyme. 

75. The method of Claim 73, wherein said Bacillus host comprises a heterologous 
gene encoding a protein of interest 

76. The method of Claim 73, wherein said Bacillus host cell is selected from the 
group consisting of B. licheniformis, S. lentus, B. subtilis, B. amyloliquefaciens S. brevis, 
B. steamthermophilus, B. clausii, B. alkalophiius, B. coagulans, B. circulans. B. pumllus 
and B. thuringiensis. 

77. The method of Claim 73, wherein said indigenous chromosomal region is 
selected from the group of regions consisting of PBSX, SKIN, prophage 7. SPp, prophage 
1. prophage 2, prophage 3, prophage 4. prophage 5. prophage 6, PPS. PKS, YVFF- 
YVEK. DHB and fragments thereof. 
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78. The method of Claim 77, wherein said altered Bacillus strain further 
comprises at least one mutation in a gene selected from the group consisting of degU, degQ. 
degS, sco4. spollE and oppA. 

79. The method of Claim 73, wherein said protein of interest is an enzyme 
selected from the group consisting of proteases, celiulases, amylases, cartjohydrases. 
lipases, isomerases, transferases, kinases, and phosphatases, 

80. The method of Claim 79. wherein said enzyme is a protease. 

81. A method for enhancing the expression of a protein of interest in eac///us 
comprising: 

a) obtaining nucleic acid firom at least one Sac///t/s ceil; 

b) perfomiing transcriptome DNA an«y analysis on said nucleic acid 
from said BeusBlus cell to identify at least one gene of interest; 

c) modifying said at least one gene of interest to produce a DNA 

construct; 

d) introducing said DNA construct into a Bacillus host cell to produce an 
altered Bacillus strain, wherein said altered Bacillus strain is capable of 
producing a protein of interest, under conditions such that expression of said 
protein of interest is enhanced as compared to the expression of said protein 
of interest in a Bacillus that has not been altered. 

82. The method of Claim 81 , wherein said protejn of interest is associated 
with at least one biochemical pathway selected from the group consisting of amino 
acid biosynthetic pathways and biodegradative pathways. 

83. The method of Claim 82, wherein said biodegradative pathway is 
disabled by transcription of said gene of interest. 

84. A method for enhancing the expression of a protein of interest in 

Bacillus, comprising: 

a) obtaining nucleic acid containing at least one gene of interest 

from at least one Bacillus cell; 

b) fragmenting said nucleic acid; 
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c) amplifying said fragments to produce a pool of amplified 
fragments comprising said at least one gene of interest; 

d) ligating said amplified fragments to produce a DNA construct; 

e) directly transfomiing said DNA construct into a Bacillus host 
cell to produce an altered Bacillus strain; 

f) culturing said altered Bacillus strain under conditions sudi tliat 
expression of said protein of interest is enhanced as compared to the expression of 
said protein of interest In a Sacfflusthat has not been altered. 

85. The method of Claim 84, wherein said altered Bac///os strain comprises 
modified gene selected from the group consisting of prpC, sigD and tdh/kbl. 

86. An isolated nucleic add comprising the sequence set forth In a nucleic add 
sequence seleded from the group consisting of SEQ ID NO: 1. SEQ ID NO: 3. SEQ ID NO: 
5, SEQ ID NO: 7, SEQ ID NO: 9. SEQ ID NO: 11. SEQ ID NO: 13. SEQ ID NO: 15. SEQ ID 
NO:39. SEQ ID NO:40. SEQ ID NO:42. SEQ ID NO:44. SEQ ID NO:46. SEQ ID NO:48. SEQ 
ID NO:50. SEQ ID NO:37, SEQ ID NO:25. SEQ ID NO:21. SEQ ID NO:50. SEQ ID NO:23. 
SEQ ID NO:27. SEQ ID N0:19. SEQ ID NO:31, SEQ ID NO:48. SEQ ID NO:46, SEQ ID 
NO:35. and SEQ ID NO:33. 

87. An isolated nudeic add sequence encoding an amino add, wherein said 
amino acid is seleded from the group consisting of SEQ ID NO: 2. SEQ ID NO: 4, SEQ ID 
NO: 6. SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16. 
SEQ ID NO:41. SEQ ID NO:43. SEQ ID NO:45. SEQ ID N0.47, SEQ ID NO:49. SEQ ID 
N0:51. SEQ ID NO:38, SEQ ID NO:26. SEQ ID N0:22. SEQ ID N0:57. SEQ ID N0:24. SEQ 
ID NO:28, SEQ ID NO:20, SEQ ID NO:32, SEQ ID NO:55, SEQ ID NO:53, SEQ ID NO:36. 
and SEQIDNO:34. 

88. An isolated amino add sequence, wherein said amino add sequence is 
seleded from the group consisting of SEQ ID NO: 2, SEQ ID NO: 4. SEQ ID NO: 6. SEQ ID 
NO: 8, SEQ ID NO: 10. SEQ ID NO: 12. SEQ ID NO: 14. SEQ ID NO: 16, SEQ ID N0:41, 
SEQ ID N0:43. SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49. SEQ ID N0:51. SEQ ID 
NO:38, SEQ ID NO:26, SEQ ID N0:22. SEQ ID N0:57. SEQ ID N0:24. SEQ ID N0:28. SEQ 
ID NO:20, SEQ ID NO:32, SEQ ID NO:55. SEQ ID NO:53. SEQ ID NO:36, and SEQ ID 
NO:34. 
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